Clustering data streams: theory and practice
University of Pennsylvania · Carnegie Mellon University · +3 more institutions
Indexed incrossref
Abstract
The data stream model has recently attracted attention for its applicability to numerous types of data, including telephone records, Web documents, and clickstreams. For analysis of such data, the ability to process the data in a single pass, or a small number of passes, while using little memory, is crucial. We describe such a streaming algorithm that effectively clusters large data streams. We also provide empirical evidence of the algorithm's performance on synthetic and real data streams.
Citation impact
900
total citations
- FWCI
- 22.23
- Percentile
- 100%
- References
- 109
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Computer science
- Data stream mining
- Cluster analysis
- Data mining
- Data stream
- Data stream clustering
- STREAMS
- Process (computing)
No related works found for this paper.