A large annotated corpus for learning natural language inference

Bowman, Samuel R.; Angeli, Gabor; Potts, Christopher; Manning, Christopher D.

doi:10.18653/v1/d15-1075

articleJan 1, 2015GOLD OA

A large annotated corpus for learning natural language inference

SRSamuel R. Bowman GAGabor Angeli CPChristopher Potts CDChristopher D. Manning

Stanford University

Indexed incrossref

Abstract

Understanding entailment and contradiction is fundamental to understanding natural language, and inference about entailment and contradiction is a valuable testing ground for the development of semantic representations. However, machine learning research in this area has been dramatically limited by the lack of large-scale resources. To address this, we introduce the Stanford Natural Language Inference corpus, a new, freely available collection of labeled sentence pairs, written by humans doing a novel grounded task based on image captioning. At 570K pairs, it is two orders of magnitude larger than all other resources of its type. This increase in scale allows lexicalized classifiers to outperform some…

Citation impact

3,428

total citations

FWCI: 70.93
Percentile: 100%
References: 31

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Natural language processing
Artificial intelligence
Inference
Natural language
Natural (archaeology)
Geology

UN Sustainable Development Goals

Quality Education

No related works found for this paper.