Identifying Relations for Open Information Extraction

Fader, Anthony; Soderland, Stephen; Etzioni, Oren

articleJul 27, 2011Closed access

Identifying Relations for Open Information Extraction

AFAnthony Fader SSStephen Soderland OEOren Etzioni

Abstract

Open Information Extraction (IE) is the task of extracting assertions from massive corpora without requiring a pre-specified vocabulary. This paper shows that the output of state-ofthe-art Open IE systems is rife with uninformative and incoherent extractions. To overcome these problems, we introduce two simple syntactic and lexical constraints on binary relations expressed by verbs. We implemented the constraints in the REVERB Open IE system, which more than doubles the area under the precision-recall curve relative to previous extractors such as TEXTRUNNER and WOE pos. More than 30 % of REVERB’s extractions are at precision 0.8 or higher— compared to virtually none for earlier systems. The paper concludes…

Citation impact

1,152

total citations

FWCI: 109.20
Percentile: 100%
References: 30

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Computer science
Task (project management)
Information extraction
Natural language processing
Simple (philosophy)
Vocabulary
Precision and recall
Binary number

UN Sustainable Development Goals

Quality Education

No related works found for this paper.