Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

McCallum, Andrew; Li, Wei

doi:10.3115/1119176.1119206

articleJan 1, 2003GOLD OA

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

AMAndrew McCallum WLWei Li

University of Massachusetts Amherst

Indexed incrossref

Abstract

Models for many natural language tasks benefit from the flexibility to use overlapping, non-independent features. For example, the need for labeled data can be drastically reduced by taking advantage of domain knowledge in the form of word lists, part-of-speech tags, character n-grams, and capitalization patterns. While it is difficult to capture such inter-dependent features with a generative probabilistic model, conditionally-trained models, such as conditional maximum entropy models, handle them well. There has been significant work with such models for greedy sequence modeling in NLP (Ratnaparkhi, 1996; Borthwick et al., 1998).

Citation impact

1,159

total citations

FWCI: 32.34
Percentile: 100%
References: 10

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Conditional random field
Computer science
Artificial intelligence
Natural language processing
Probabilistic logic
Principle of maximum entropy
Language model
Generative grammar

No related works found for this paper.

Funding

DA
Defense Advanced Research Projects Agency