articleJan 1, 2003GOLD OA

Early results for named entity recognition with conditional random fields, feature induction and web-enhanced lexicons

University of Massachusetts Amherst

Indexed incrossref

Abstract

Models for many natural language tasks benefit from the flexibility to use overlapping, non-independent features. For example, the need for labeled data can be drastically reduced by taking advantage of domain knowledge in the form of word lists, part-of-speech tags, character n-grams, and capitalization patterns. While it is difficult to capture such inter-dependent features with a generative probabilistic model, conditionally-trained models, such as conditional maximum entropy models, handle them well. There has been significant work with such models for greedy sequence modeling in NLP (Ratnaparkhi, 1996; Borthwick et al., 1998).

Citation impact

1,159
total citations
FWCI
32.34
Percentile
100%
References
10
Citations per year

Authors

2

Topics & keywords

Keywords
  • Conditional random field
  • Computer science
  • Artificial intelligence
  • Natural language processing
  • Probabilistic logic
  • Principle of maximum entropy
  • Language model
  • Generative grammar
No related works found for this paper.

Funding