articleACM Transactions on Knowledge Discovery from DataMar 1, 2009Closed access

Clustering high-dimensional data

Ludwig-Maximilians-Universität München

Indexed incrossref

Abstract

As a prolific research area in data mining, subspace clustering and related problems induced a vast quantity of proposed solutions. However, many publications compare a new proposition—if at all—with one or two competitors, or even with a so-called “naïve” ad hoc solution, but fail to clarify the exact problem definition. As a consequence, even if two solutions are thoroughly compared experimentally, it will often remain unclear whether both solutions tackle the same problem or, if they do, whether they agree in certain tacit assumptions and how such assumptions may influence the outcome of an algorithm. In this survey, we try to clarify: (i) the different problem definitions related to subspace clustering in…

Citation impact

1,086
total citations
FWCI
97.34
Percentile
100%
References
140
Citations per year

Authors

3

Topics & keywords

Keywords
  • Cluster analysis
  • Heuristics
  • Computer science
  • Proposition
  • Subspace topology
  • Basis (linear algebra)
  • Outcome (game theory)
  • Clustering high-dimensional data
No related works found for this paper.