Learning Self-Growth Maps for Fast and Accurate Imbalanced Streaming Data Clustering
Guangdong University of Technology · Hong Kong Baptist University
Abstract
Streaming data clustering is a popular research topic in data mining and machine learning. Since streaming data is usually analyzed in data chunks, it is more susceptible to encountering the dynamic cluster imbalance issue. That is, the imbalance ratio (IR) of clusters changes over time, which can easily lead to fluctuations in either the accuracy or the efficiency of streaming data clustering. Therefore, an accurate and efficient streaming data clustering approach is proposed to adapt to the drifting and imbalanced cluster distributions. We first design a self-growth map (SGM) that can automatically arrange neurons on demand according to local distribution, and thus achieve fast and incremental adaptation to…
Citation impact
- FWCI
- 127.81
- Percentile
- 100%
- References
- 61
Authors
8- YZYiqun ZhangCorresponding
Guangdong University of Technology
- SFSen Feng
Guangdong University of Technology
- PWPengkai Wang
Guangdong University of Technology
- ZTZexi Tan
Guangdong University of Technology
- XLXiaopeng Luo
Guangdong University of Technology
Topics & keywords
- Cluster analysis
- Computer science
- Streaming data
- Artificial intelligence
- Data mining