Adversarial NLI: A New Benchmark for Natural Language Understanding

Nie, Yixin; Williams, Adina; Dinan, Emily; Bansal, Mohit; Weston, Jason; Kiela, Douwe

doi:10.18653/v1/2020.acl-main.441

preprintJan 1, 2020GOLD OA

Adversarial NLI: A New Benchmark for Natural Language Understanding

YNYixin Nie AWAdina Williams EDEmily Dinan MBMohit Bansal JWJason Weston

University of North Carolina Health Care · University of North Carolina at Chapel Hill · +1 more institution

Indexed incrossref

Abstract

We introduce a new large-scale NLI benchmark dataset, collected via an iterative, adversarial human-and-model-in-the-loop procedure. We show that training models on this new dataset leads to state-of-the-art performance on a variety of popular NLI benchmarks, while posing a more difficult challenge with its new test set. Our analysis sheds light on the shortcomings of current state-of-theart models, and shows that non-expert annotators are successful at finding their weaknesses. The data collection method can be applied in a never-ending learning scenario, becoming a moving target for NLU, rather than a static benchmark that will quickly saturate.

Citation impact

606

total citations

FWCI: 65.29
Percentile: 100%
References: 75

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Benchmark (surveying)
Computer science
Adversarial system
Variety (cybernetics)
Artificial intelligence
Machine learning
Set (abstract data type)
Strengths and weaknesses

UN Sustainable Development Goals

Quality Education

No related works found for this paper.