Model-Based Clustering, Discriminant Analysis, and Density Estimation

Fraley, Chris; Raftery, Adrian E.

doi:10.1198/016214502760047131

articleJournal of the American Statistical AssociationJun 1, 2002Closed access

Model-Based Clustering, Discriminant Analysis, and Density Estimation

CFChris Fraley AEAdrian E. Raftery

Walsh University

Indexed incrossref

Abstract

Cluster analysis is the automated search for groups of related observations in a dataset. Most clustering done in practice is based largely on heuristic but intuitively reasonable procedures, and most clustering methods available in commercial software are also of this type. However, there is little systematic guidance associated with these methods for solving important practical questions that arise in cluster analysis, such as how many clusters are there, which clustering method should be used, and how should outliers be handled. We review a general methodology for model-based clustering that provides a principled statistical approach to these issues. We also show that this can be useful for other problems…

Citation impact

4,259

total citations

FWCI: 63.53
Percentile: 100%
References: 144

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Cluster analysis
Computer science
Data mining
Artificial intelligence
Outlier
Heuristic
Clustering high-dimensional data
Linear discriminant analysis

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.