articleACM SIGKDD Explorations NewsletterJun 1, 2004Closed access

Subspace clustering for high dimensional data

Arizona State University

Indexed incrossref

Abstract

Subspace clustering is an extension of traditional clustering that seeks to find clusters in different subspaces within a dataset. Often in high dimensional data, many dimensions are irrelevant and can mask existing clusters in noisy data. Feature selection removes irrelevant and redundant dimensions by analyzing the entire dataset. Subspace clustering algorithms localize the search for relevant dimensions allowing them to find clusters that exist in multiple, possibly overlapping subspaces. There are two major branches of subspace clustering based on their search strategy. Top-down algorithms find an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, iteratively…

Citation impact

1,339
total citations
FWCI
48.10
Percentile
100%
References
87
Citations per year

Authors

3

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • Linear subspace
  • Clustering high-dimensional data
  • Correlation clustering
  • Data mining
  • CURE data clustering algorithm
  • Subspace topology
No related works found for this paper.