Attention on Attention for Image Captioning

Huang, Lun; Wang, Wenmin; Chen, Jie; Wei, Xiao-Yong

doi:10.1109/iccv.2019.00473

articleOct 1, 2019Closed access

Attention on Attention for Image Captioning

LHLun Huang WWWenmin Wang JCJie Chen XWXiao-Yong Wei

Peking University · Macau University of Science and Technology · +1 more institution

Indexed incrossref

Abstract

Attention mechanisms are widely used in current encoder/decoder frameworks of image captioning, where a weighted average on encoded vectors is generated at each time step to guide the caption decoding process. However, the decoder has little idea of whether or how well the attended vector and the given attention query are related, which could make the decoder give misled results. In this paper, we propose an Attention on Attention (AoA) module, which extends the conventional attention mechanisms to determine the relevance between attention results and queries. AoA first generates an information vector and an attention gate using the attention result and the current context, then adds another attention by…

Citation impact

1,006

total citations

FWCI: 48.38
Percentile: 100%
References: 74

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Closed captioning
Computer science
Context (archaeology)
Encoder
Decoding methods
Relevance (law)
Code (set theory)
Image (mathematics)

No related works found for this paper.