NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set
Centre d'Etudes et De Recherche en Informatique et Communications
Abstract
Clustering is the partitioning of a set of objects into groups (clusters) so that objects within a group are more similar to each others than objects in different groups. Most of the clustering algorithms depend on some assumptions in order to define the subgroups present in a data set. As a consequence, the resulting clustering scheme requires some sort of evaluation as regards its validity.The evaluation procedure has to tackle difficult problems such as the quality of clusters, the degree with which a clustering scheme fits a specific data set and the optimal number of clusters in a partitioning. In the literature, a wide variety of indices have been proposed to find the optimal number of clusters in a…
Citation impact
- FWCI
- 88.82
- Percentile
- 100%
- References
- 51
Authors
4Topics & keywords
- Set (abstract data type)
- R package
- Data set
- Computer science
- Artificial intelligence
- Programming language