articleAug 20, 2006Closed access

Evolutionary clustering

Yahoo (United States)

Indexed incrossref

Abstract

We consider the problem of clustering data over time. An evolutionary clustering should simultaneously optimize two potentially conflicting criteria: first, the clustering at any point in time should remain faithful to the current data as much as possible; and second, the clustering should not shift dramatically from one timestep to the next. We present a generic framework for this problem, and discuss evolutionary versions of two widely-used clustering algorithms within this framework: k-means and agglomerative hierarchical clustering. We extensively evaluate these algorithms on real data sets and show that our algorithms can simultaneously attain both high accuracy in capturing today's data, and high…

Citation impact

657
total citations
FWCI
23.30
Percentile
100%
References
24
Citations per year

Authors

3

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • CURE data clustering algorithm
  • Canopy clustering algorithm
  • Correlation clustering
  • Single-linkage clustering
  • Data stream clustering
  • Data mining
No related works found for this paper.