articleJan 1, 2004GREEN OA

Active learning using pre-clustering

University of Amsterdam

Indexed incrossref

Abstract

The paper is concerned with two-class active learning. While the common approach for collecting data in active learning is to select samples close to the classification boundary, better performance can be achieved by taking into account the prior data distribution. The main contribution of the paper is a formal framework that incorporates clustering into active learning. The algorithm first constructs a classifier on the set of the cluster representatives, and then propagates the classification decision to the other samples via a local noise model. The proposed model allows to select the most representative samples as well as to avoid repeatedly labeling samples in the same cluster. During the active learning…

Citation impact

648
total citations
FWCI
14.34
Percentile
100%
References
24
Citations per year

Authors

2

Topics & keywords

Keywords
  • Cluster analysis
  • Computer science
  • Decision boundary
  • Artificial intelligence
  • Classifier (UML)
  • Machine learning
  • Data mining
  • Pattern recognition (psychology)
UN Sustainable Development Goals
  • Peace, Justice and strong institutions
No related works found for this paper.