preprintarXiv (Cornell University)Jan 9, 2019GREEN OA

Bootstrapping a Data-Set and Model for Question-Answering in Portuguese (Short Paper)

DZDai, ZihangYZYang, ZhilinYYYang, YimingCJCarbonell, JaimeLQLe, Quoc V.

Centre National de la Recherche Scientifique · Institut national de recherche en sciences et technologies du numérique · +3 more institutions

Indexed inarxivdatacite

Abstract

Question answering systems are mainly concerned with fulfilling an information query written in natural language, given a collection of documents with relevant information. They are key elements in many popular application systems as personal assistants, chat-bots, or even FAQ-based online support systems. This paper describes an exploratory work carried out to come up with a state-of-the-art model for question-answering tasks, for the Portuguese language, based on deep neural networks. We also describe the automatic construction of a data-set for training and testing the model. The final model is not trained in any specific topic or context, and is able to handle generic documents, achieving 50% accuracy in…

Citation impact

594
total citations
FWCI
65.87
Percentile
100%
References
60
Citations per year

Authors

6
  • DZ
    Dai, ZihangCorresponding

    Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University

  • YZ
    Yang, Zhilin

    Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University

  • YY
    Yang, Yiming

    Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University

  • CJ
    Carbonell, Jaime
  • LQ
    Le, Quoc V.

Topics & keywords

Keywords
  • Perplexity
  • Computer science
  • Language model
  • Transformer
  • Hyperparameter
  • Artificial intelligence
  • Treebank
  • Natural language processing
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding