articleJun 19, 2011Closed access

Knowledge-Based Weak Supervision for Information Extraction of Overlapping Relations

University of Washington

Abstract

Information extraction (IE) holds the promise of generating a large-scale knowledge base from the Web’s natural language text. Knowledge-based weak supervision, using structured data to heuristically label a training corpus, works towards this goal by enabling the automated learning of a potentially unbounded number of relation extractors. Recently, researchers have developed multiinstance learning algorithms to combat the noisy training data that can come from heuristic labeling, but their models assume relations are disjoint — for example they cannot extract the pair Founded(Jobs, Apple) and CEO-of(Jobs, Apple). This paper presents a novel approach for multi-instance learning with overlapping relations that…

Citation impact

922
total citations
FWCI
47.36
Percentile
100%
References
22
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Relationship extraction
  • Disjoint sets
  • Knowledge base
  • Sentence
  • Aggregate (composite)
  • Heuristic
  • Information extraction
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.