Image Captioning with Semantic Attention
University of Rochester · Adobe Systems (United States)
Abstract
Automatically generating a natural language description of an image has attracted interests recently both because of its importance in practical applications and because it connects two major artificial intelligence fields: computer vision and natural language processing. Existing approaches are either top-down, which start from a gist of an image and convert it into words, or bottom-up, which come up with words describing various aspects of an image and then combine them. In this paper, we propose a new algorithm that combines both approaches through a model of semantic attention. Our algorithm learns to selectively attend to semantic concept proposals and fuse them into hidden states and outputs of recurrent…
Citation impact
- FWCI
- 97.98
- Percentile
- 100%
- References
- 66
Authors
5Topics & keywords
- Computer science
- Closed captioning
- Fuse (electrical)
- Artificial intelligence
- Image (mathematics)
- Semantics (computer science)
- Selection (genetic algorithm)
- Natural language
- Quality Education