articleAug 20, 2006Closed access
Evolutionary clustering
Indexed incrossref
Abstract
We consider the problem of clustering data over time. An evolutionary clustering should simultaneously optimize two potentially conflicting criteria: first, the clustering at any point in time should remain faithful to the current data as much as possible; and second, the clustering should not shift dramatically from one timestep to the next. We present a generic framework for this problem, and discuss evolutionary versions of two widely-used clustering algorithms within this framework: k-means and agglomerative hierarchical clustering. We extensively evaluate these algorithms on real data sets and show that our algorithms can simultaneously attain both high accuracy in capturing today's data, and high…
Citation impact
657
total citations
- FWCI
- 23.30
- Percentile
- 100%
- References
- 24
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Cluster analysis
- Computer science
- CURE data clustering algorithm
- Canopy clustering algorithm
- Correlation clustering
- Single-linkage clustering
- Data stream clustering
- Data mining
No related works found for this paper.