A probabilistic framework for semi-supervised clustering
The University of Texas at Austin
Abstract
Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clusters. In recent years, a number of algorithms have been proposed for enhancing clustering quality by employing such supervision. Such methods use the constraints to either modify the objective function, or to learn the distance measure. We propose a probabilistic model for semi-supervised clustering based on Hidden Markov Random Fields (HMRFs) that provides a principled framework for incorporating supervision into prototype-based clustering. The model generalizes a previous approach that combines constraints and Euclidean distance…
Citation impact
- FWCI
- 50.88
- Percentile
- 100%
- References
- 50
Authors
3Topics & keywords
- Probabilistic logic
- Computer science
- Cluster analysis
- Artificial intelligence
- Machine learning
- Data mining