articleJan 1, 2008GOLD OA

An analysis of active learning strategies for sequence labeling tasks

University of Wisconsin–Madison

Indexed incrossref

Abstract

Active learning is well-suited to many problems in natural language processing, where unlabeled data may be abundant but annotation is slow and expensive. This paper aims to shed light on the best active learning approaches for sequence labeling tasks such as information extraction and document segmentation. We survey previously used query selection strategies for sequence models, and propose several novel algorithms to address their shortcomings. We also conduct a large-scale empirical comparison using multiple corpora, which demonstrates that our proposed methods advance the state of the art.

Citation impact

986
total citations
FWCI
37.01
Percentile
100%
References
36
Citations per year

Authors

2

Topics & keywords

Keywords
  • Computer science
  • Sequence labeling
  • Active learning (machine learning)
  • Annotation
  • Artificial intelligence
  • Sequence (biology)
  • Machine learning
  • Information extraction
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding