articleAug 22, 2004Closed access

A probabilistic framework for semi-supervised clustering

The University of Texas at Austin

Indexed incrossref

Abstract

Unsupervised clustering can be significantly improved using supervision in the form of pairwise constraints, i.e., pairs of instances labeled as belonging to same or different clusters. In recent years, a number of algorithms have been proposed for enhancing clustering quality by employing such supervision. Such methods use the constraints to either modify the objective function, or to learn the distance measure. We propose a probabilistic model for semi-supervised clustering based on Hidden Markov Random Fields (HMRFs) that provides a principled framework for incorporating supervision into prototype-based clustering. The model generalizes a previous approach that combines constraints and Euclidean distance…

Citation impact

818
total citations
FWCI
50.88
Percentile
100%
References
50
Citations per year

Authors

3

Topics & keywords

Keywords
  • Probabilistic logic
  • Computer science
  • Cluster analysis
  • Artificial intelligence
  • Machine learning
  • Data mining
No related works found for this paper.

Funding