Attention on Attention for Image Captioning
Peking University · Macau University of Science and Technology · +1 more institution
Abstract
Attention mechanisms are widely used in current encoder/decoder frameworks of image captioning, where a weighted average on encoded vectors is generated at each time step to guide the caption decoding process. However, the decoder has little idea of whether or how well the attended vector and the given attention query are related, which could make the decoder give misled results. In this paper, we propose an Attention on Attention (AoA) module, which extends the conventional attention mechanisms to determine the relevance between attention results and queries. AoA first generates an information vector and an attention gate using the attention result and the current context, then adds another attention by…
Citation impact
- FWCI
- 48.38
- Percentile
- 100%
- References
- 74
Authors
4Topics & keywords
- Closed captioning
- Computer science
- Context (archaeology)
- Encoder
- Decoding methods
- Relevance (law)
- Code (set theory)
- Image (mathematics)