Recurrent Models of Visual Attention

Mnih, Volodymyr; Heess, Nicolas; Graves, Alex; Kavukcuoglu, Koray

preprintarXiv (Cornell University)Jun 24, 2014GREEN OA

Recurrent Models of Visual Attention

VMVolodymyr Mnih NHNicolas Heess AGAlex Graves KKKoray Kavukcuoglu

Google (United States) · Google DeepMind (United Kingdom)

Abstract

Applying convolutional neural networks to large images is computationally ex-pensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is ca-pable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it per-forms can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn…

Citation impact

2,276

total citations

FWCI: —
Percentile: —
References: 22

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Convolutional neural network
Computation
Differentiable function
Pattern recognition (psychology)
Task (project management)
Image (mathematics)

No related works found for this paper.