Image Captioning with Semantic Attention

You, Quanzeng; Jin, Hailin; Wang, Zhaowen; Chen, Fang; Luo, Jiebo

doi:10.1109/cvpr.2016.503

articleJun 1, 2016Closed access

Image Captioning with Semantic Attention

QYQuanzeng You HJHailin Jin ZWZhaowen Wang FCFang Chen JLJiebo Luo

University of Rochester · Adobe Systems (United States)

Indexed incrossref

Abstract

Automatically generating a natural language description of an image has attracted interests recently both because of its importance in practical applications and because it connects two major artificial intelligence fields: computer vision and natural language processing. Existing approaches are either top-down, which start from a gist of an image and convert it into words, or bottom-up, which come up with words describing various aspects of an image and then combine them. In this paper, we propose a new algorithm that combines both approaches through a model of semantic attention. Our algorithm learns to selectively attend to semantic concept proposals and fuse them into hidden states and outputs of recurrent…

Citation impact

1,733

total citations

FWCI: 97.98
Percentile: 100%
References: 66

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Closed captioning
Fuse (electrical)
Artificial intelligence
Image (mathematics)
Semantics (computer science)
Selection (genetic algorithm)
Natural language

UN Sustainable Development Goals

Quality Education

No related works found for this paper.