A SICK cure for the evaluation of compositional distributional semantic models
University of Trento · Fondazione Bruno Kessler
Abstract
Shared and internationally recognized benchmarks are fundamental for the development of any computational system. We aim to help the research community working on compositional distributional semantic models (CDSMs) by providing SICK (Sentences Involving Compositional Knowldedge), a large size English benchmark tailored for them. SICK consists of about 10,000 English sentence pairs that include many examples of the lexical, syntactic and semantic phenomena that CDSMs are expected to account for, but do not require dealing with other aspects of existing sentential data sets (idiomatic multiword expressions, named entities, telegraphic language) that are not within the scope of CDSMs. By means of crowdsourcing…
Citation impact
- FWCI
- 49.48
- Percentile
- 100%
- References
- 14
Authors
6Topics & keywords
- Computer science
- Natural language processing
- Textual entailment
- Sentence
- Artificial intelligence
- Task (project management)
- Logical consequence
- Meaning (existential)