K‐means clustering: A half‐century synthesis

Steinley, Douglas

doi:10.1348/000711005x48266

articleBritish Journal of Mathematical and Statistical PsychologyMay 1, 2006Closed access

K‐means clustering: A half‐century synthesis

DSDouglas Steinley

University of Missouri

PubMed

Indexed incrossrefpubmed

Abstract

This paper synthesizes the results, methodology, and research conducted concerning the K-means clustering method over the last fifty years. The K-means method is first introduced, various formulations of the minimum variance loss function and alternative loss functions within the same class are outlined, and different methods of choosing the number of clusters and initialization, variable preprocessing, and data reduction schemes are discussed. Theoretic statistical results are provided and various extensions of K-means using different metrics or modifications of the original algorithm are given, leading to a unifying treatment of K-means and some of its extensions. Finally, several future studies are outlined…

Citation impact

1,100

total citations

FWCI: 18.18
Percentile: 100%
References: 238

Citations per year

Authors

1

DS
Douglas SteinleyCorresponding
University of Missouri

Topics & keywords

Topics

Keywords

Initialization
Cluster analysis
Preprocessor
Computer science
Variance (accounting)
Class (philosophy)
Reduction (mathematics)
Function (biology)

No related works found for this paper.