Multi-view clustering via canonical correlation analysis
University of California, San Diego · Toyota Technological Institute at Chicago
Abstract
Clustering data in high-dimensions is believed to be a hard problem in general. A number of efficient clustering algorithms developed in recent years address this problem by projecting the data into a lower-dimensional subspace, e.g. via Principal Components Analysis (PCA) or random projections, before clustering. Such techniques typically require stringent requirements on the separation between the cluster means (in order for the algorithm to be be successful).\nHere, we show how using multiple views of the data can relax these stringent requirements. We use Canonical Correlation Analysis (CCA) to project the data in each view to a lower-dimensional subspace. Under the assumption that conditioned on the…
Citation impact
- FWCI
- 28.07
- Percentile
- 100%
- References
- 21
Authors
4Topics & keywords
- Canonical correlation
- Cluster analysis
- Computer science
- Correlation
- Artificial intelligence
- Mathematics
- Quality Education