Diffusion maps and coarse-graining: a unified framework for dimensionality reduction, graph partitioning, and data set parameterization

Google (United States) · Carnegie Mellon University

PubMed
Indexed incrossrefpubmed

Abstract

We provide evidence that nonlinear dimensionality reduction, clustering, and data set parameterization can be solved within one and the same framework. The main idea is to define a system of coordinates with an explicit metric that reflects the connectivity of a given data set and that is robust to noise. Our construction, which is based on a Markov random walk on the data, offers a general scheme of simultaneously reorganizing and subsampling graphs and arbitrarily shaped data sets in high dimensions using intrinsic geometry. We show that clustering in embedding spaces is equivalent to compressing operators. The objective of data partitioning and clustering is to coarse-grain the random walk on the data while…

Citation impact

629
total citations
FWCI
36.40
Percentile
100%
References
29
Citations per year

Authors

2

Topics & keywords

Keywords
  • Diffusion map
  • Cluster analysis
  • Dimensionality reduction
  • Random walk
  • Mathematics
  • Algorithm
  • Computer science
  • Nonlinear dimensionality reduction
No related works found for this paper.