Performance evaluation of some clustering algorithms and validity indices
The University of Texas at Arlington · Indian Statistical Institute
Abstract
In this article, we evaluate the performance of three clustering algorithms, hard K-Means, single linkage, and a simulated annealing (SA) based technique, in conjunction with four cluster validity indices, namely Davies-Bouldin index, Dunn's index, Calinski-Harabasz index, and a recently developed index I. Based on a relation between the index I and the Dunn's index, a lower bound of the value of the former is theoretically estimated in order to get unique hard K-partition when the data set has distinct substructures. The effectiveness of the different validity indices and clustering methods in automatically evolving the appropriate number of clusters is demonstrated experimentally for both artificial and…
Citation impact
- FWCI
- 17.05
- Percentile
- 100%
- References
- 21
Authors
2Topics & keywords
- Cluster analysis
- Single-linkage clustering
- Partition (number theory)
- Computer science
- Determining the number of clusters in a data set
- Data mining
- Algorithm
- Cluster (spacecraft)