Subspace clustering for high dimensional data

Parsons, Lance; Haque, Ehtesham; Liu, Huan

doi:10.1145/1007730.1007731

articleACM SIGKDD Explorations NewsletterJun 1, 2004Closed access

Subspace clustering for high dimensional data

LPLance Parsons EHEhtesham Haque HLHuan Liu

Arizona State University

Indexed incrossref

Abstract

Subspace clustering is an extension of traditional clustering that seeks to find clusters in different subspaces within a dataset. Often in high dimensional data, many dimensions are irrelevant and can mask existing clusters in noisy data. Feature selection removes irrelevant and redundant dimensions by analyzing the entire dataset. Subspace clustering algorithms localize the search for relevant dimensions allowing them to find clusters that exist in multiple, possibly overlapping subspaces. There are two major branches of subspace clustering based on their search strategy. Top-down algorithms find an initial clustering in the full set of dimensions and evaluate the subspaces of each cluster, iteratively…

Citation impact

1,339

total citations

FWCI: 48.10
Percentile: 100%
References: 87

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Cluster analysis
Computer science
Linear subspace
Clustering high-dimensional data
Correlation clustering
Data mining
CURE data clustering algorithm
Subspace topology

No related works found for this paper.