Attention in Natural Language Processing
University of Bologna · University of Modena and Reggio Emilia
Abstract
Attention is an increasingly popular mechanism used in a wide range of neural architectures. The mechanism itself has been realized in a variety of formats. However, because of the fast-paced advances in this domain, a systematic overview of attention is still missing. In this article, we define a unified model for attention architectures in natural language processing, with a focus on those designed to work with vector representations of the textual data. We propose a taxonomy of attention models according to four dimensions: the representation of the input, the compatibility function, the distribution function, and the multiplicity of the input and/or output. We present the examples of how prior information…
Citation impact
- FWCI
- 46.52
- Percentile
- 100%
- References
- 257
Authors
3Topics & keywords
- Computer science
- Categorization
- Artificial intelligence
- Variety (cybernetics)
- Natural language
- Natural language understanding
- Data science
- Natural language processing
- Quality Education