Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms
Florida Institute of Technology
Abstract
Many clustering and segmentation algorithms both suffer from the limitation that the number of clusters/segments is specified by a human user. It is often impractical to expect a human with sufficient domain knowledge to be available to select the number of clusters/segments to return. We investigate techniques to determine the number of clusters or segments to return from hierarchical clustering and segmentation algorithms. We propose an efficient algorithm, the L method that finds the "knee" in a '# of clusters vs. clustering evaluation metric' graph. Using the knee is well-known, but is not a particularly well-understood method to determine the number of clusters. We explore the feasibility of this method,…
Citation impact
- FWCI
- 30.32
- Percentile
- 100%
- References
- 30
Authors
2Topics & keywords
- Cluster analysis
- Computer science
- Segmentation
- Hierarchical clustering
- Single-linkage clustering
- Correlation clustering
- Determining the number of clusters in a data set
- Data mining