Distance Metric Learning with Application to Clustering with Side-Information
University of California, Berkeley
Abstract
Many algorithms rely critically on being given a good metric over their inputs. For instance, data can often be clustered in many "plausible" ways, and if a clustering algorithm such as K-means initially fails to find one that is meaningful to a user, the only recourse may be for the user to manually tweak the metric until sufficiently good clusters are found. For these and other applications requiring good metrics, it is desirable that we provide a more systematic way for users to indicate what they consider "similar." For instance, we may ask them to provide examples. In this paper, we present an algorithm that, given examples of similar (and, if desired, dissimilar) pairs…
Citation impact
- FWCI
- 37.09
- Percentile
- 100%
- References
- 13
Authors
4Topics & keywords
- Cluster analysis
- Metric (unit)
- Computer science
- Data mining
- Machine learning
- Artificial intelligence