Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

McCoy, Tom; Pavlick, Ellie; Linzen, Tal

doi:10.18653/v1/p19-1334

articleJan 1, 2019GOLD OA

Right for the Wrong Reasons: Diagnosing Syntactic Heuristics in Natural Language Inference

TMTom McCoy EPEllie Pavlick TLTal Linzen

Johns Hopkins University · John Brown University

Indexed incrossref

Abstract

A machine learning system can score well on a given test set by relying on heuristics that are effective for frequent example types but break down in more challenging cases. We study this issue within natural language inference (NLI), the task of determining whether one sentence entails another. We hypothesize that statistical NLI models may adopt three fallible syntactic heuristics: the lexical overlap heuristic, the subsequence heuristic, and the constituent heuristic. To determine whether models have adopted these heuristics, we introduce a controlled evaluation set called HANS (Heuristic Analysis for NLI Systems), which contains many examples where the heuristics fail. We find that models trained on MNLI,…

Citation impact

914

total citations

FWCI: 87.49
Percentile: 100%
References: 55

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Heuristics
Computer science
Heuristic
Artificial intelligence
Natural language processing
Subsequence
Set (abstract data type)
Task (project management)

UN Sustainable Development Goals

Quality Education

No related works found for this paper.