A survey of named entity recognition and classification

Nadeau, David R.; Sekine, Satoshi

doi:10.1075/li.30.1.03nad

articleLingvisticae InvestigationesAug 10, 2007Closed access

A survey of named entity recognition and classification

DRDavid R. Nadeau SSSatoshi Sekine

National Research Council Canada · New York University

Indexed incrossref

Abstract

This survey covers fifteen years of research in the Named Entity Recognition and Classification (NERC) field, from 1991 to 2006. We report observations about languages, named entity types, domains and textual genres studied in the literature. From the start, NERC systems have been developed using hand-made rules, but now machine learning techniques are widely used. These techniques are surveyed along with other critical aspects of NERC such as features and evaluation methods. Features are word-level, dictionary-level and corpus-level representations of words in a document. Evaluation techniques, ranging from intuitive exact match to very complex matching techniques with adjustable cost of errors, are an…

Citation impact

2,479

total citations

FWCI: 49.87
Percentile: 100%
References: 81

Citations per year

Authors

2

Topics & keywords

Topics

Keywords

Computer science
Natural language processing
Matching (statistics)
Artificial intelligence
Field (mathematics)
Information retrieval
Named entity
Key (lock)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.