Recurrent Models of Visual Attention

Mnih, Volodymyr; Heess, Nicolas; Graves, Alex; Kavukcuoglu, Koray

doi:10.48550/arxiv.1406.6247

preprintarXiv (Cornell University)Jun 24, 2014GREEN OA

Recurrent Models of Visual Attention

VMVolodymyr Mnih NHNicolas Heess AGAlex Graves KKKoray Kavukcuoglu

DeepMind (United Kingdom) · Google (United States)

Indexed inarxivdatacite

Abstract

Applying convolutional neural networks to large images is computationally expensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is capable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it performs can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn…

Citation impact

1,002

total citations

FWCI: —
Percentile: —
References: 26

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Visual attention
Psychology
Cognitive psychology
Computer science
Perception
Neuroscience

No related works found for this paper.