articleJan 1, 2015GOLD OA

WikiQA: A Challenge Dataset for Open-Domain Question Answering

Microsoft (United States) · Georgia Institute of Technology

Indexed incrossref

Abstract

We describe the WIKIQA dataset, a new publicly available set of question and sentence pairs, collected and annotated for research on open-domain question answering. Most previous work on answer sentence selection focuses on a dataset created using the TREC-QA data, which includes editor-generated questions and candidate answer sentences selected by matching content words in the question. WIKIQA is constructed using a more natural process and is more than an order of magnitude larger than the previous dataset. In addition, the WIKIQA dataset also includes questions for which there are no correct sentences, enabling researchers to work on answer triggering, a critical component in any QA system. We compare…

Citation impact

852
total citations
FWCI
87.73
Percentile
100%
References
11
Citations per year

Authors

3

Topics & keywords

Keywords
  • Open domain
  • Question answering
  • Computer science
  • Domain (mathematical analysis)
  • Information retrieval
  • Artificial intelligence
  • Mathematics
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.