NbClust: An R Package for Determining the Relevant Number of Clusters in a Data Set

Centre d'Etudes et De Recherche en Informatique et Communications

Indexed indoaj

Abstract

Clustering is the partitioning of a set of objects into groups (clusters) so that objects within a group are more similar to each others than objects in different groups. Most of the clustering algorithms depend on some assumptions in order to define the subgroups present in a data set. As a consequence, the resulting clustering scheme requires some sort of evaluation as regards its validity.The evaluation procedure has to tackle difficult problems such as the quality of clusters, the degree with which a clustering scheme fits a specific data set and the optimal number of clusters in a partitioning. In the literature, a wide variety of indices have been proposed to find the optimal number of clusters in a…

Citation impact

1,606
total citations
FWCI
88.82
Percentile
100%
References
51
Citations per year

Authors

4

Topics & keywords

Keywords
  • Set (abstract data type)
  • R package
  • Data set
  • Computer science
  • Artificial intelligence
  • Programming language
No related works found for this paper.