articleIEEE Transactions on Knowledge and Data EngineeringNov 1, 2005Closed access

Document clustering using locality preserving indexing

University of Illinois Urbana-Champaign · University of Chicago

Indexed incrossref

Abstract

We propose a novel document clustering method which aims to cluster the documents into different semantic classes. The document space is generally of high dimensionality and clustering in such a high dimensional space is often infeasible due to the curse of dimensionality. By using locality preserving indexing (LPI), the documents can be projected into a lower-dimensional semantic space in which the documents related to the same semantics are close to each other. Different from previous document clustering methods based on latent semantic indexing (LSI) or nonnegative matrix factorization (NMF), our method tries to discover both the geometric and discriminating structures of the document space. Theoretical…

Citation impact

796
total citations
FWCI
9.87
Percentile
100%
References
40
Citations per year

Authors

3

Topics & keywords

Keywords
  • Document clustering
  • Computer science
  • Locality
  • Cluster analysis
  • Search engine indexing
  • Non-negative matrix factorization
  • Curse of dimensionality
  • Artificial intelligence
UN Sustainable Development Goals
  • Reduced inequalities
No related works found for this paper.