Bootstrapping a Data-Set and Model for Question-Answering in Portuguese (Short Paper)
Centre National de la Recherche Scientifique · Institut national de recherche en sciences et technologies du numérique · +3 more institutions
Abstract
Question answering systems are mainly concerned with fulfilling an information query written in natural language, given a collection of documents with relevant information. They are key elements in many popular application systems as personal assistants, chat-bots, or even FAQ-based online support systems. This paper describes an exploratory work carried out to come up with a state-of-the-art model for question-answering tasks, for the Portuguese language, based on deep neural networks. We also describe the automatic construction of a data-set for training and testing the model. The final model is not trained in any specific topic or context, and is able to handle generic documents, achieving 50% accuracy in…
Citation impact
- FWCI
- 65.87
- Percentile
- 100%
- References
- 60
Authors
6- DZDai, ZihangCorresponding
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University
- YZYang, Zhilin
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University
- YYYang, Yiming
Centre National de la Recherche Scientifique, Institut national de recherche en sciences et technologies du numérique, Institut de Recherche en Informatique et Systèmes Aléatoires, Université de Rennes, Carnegie Mellon University
- CJCarbonell, Jaime
- LQLe, Quoc V.
Topics & keywords
- Perplexity
- Computer science
- Language model
- Transformer
- Hyperparameter
- Artificial intelligence
- Treebank
- Natural language processing
- Quality Education