Large-Scale Named Entity Disambiguation Based on Wikipedia Data

Cucerzan, Silviu

articleJun 1, 2007Closed access

Large-Scale Named Entity Disambiguation Based on Wikipedia Data

Abstract

This paper presents a large-scale system for the recognition and semantic disambiguation of named entities based on information extracted from a large encyclopedic collection and Web search results. It describes in detail the disambiguation paradigm employed and the information extraction process from Wikipedia. Through a process of maximizing the agreement between the contextual information extracted from Wikipedia and the context of a document, as well as the agreement among the category tags associated with the candidate entities, the implemented system shows high disambiguation accuracy on both news stories and Wikipedia articles. 1 Introduction and Related Work

Citation impact

1,076

total citations

FWCI: 70.69
Percentile: 100%
References: 20

Citations per year

Authors

1

SC
Silviu CucerzanCorresponding
Microsoft Research (United Kingdom)

Topics & keywords

Topics

Keywords

Computer science
Information retrieval
Context (archaeology)
Process (computing)
Information extraction
Scale (ratio)
Natural language processing
Entity linking

UN Sustainable Development Goals

Quality Education

No related works found for this paper.