preprintarXiv (Cornell University)Jun 24, 2014GREEN OA

Recurrent Models of Visual Attention

Google (United States) · Google DeepMind (United Kingdom)

Abstract

Applying convolutional neural networks to large images is computationally ex-pensive because the amount of computation scales linearly with the number of image pixels. We present a novel recurrent neural network model that is ca-pable of extracting information from an image or video by adaptively selecting a sequence of regions or locations and only processing the selected regions at high resolution. Like convolutional neural networks, the proposed model has a degree of translation invariance built-in, but the amount of computation it per-forms can be controlled independently of the input image size. While the model is non-differentiable, it can be trained using reinforcement learning methods to learn…

Citation impact

2,276
total citations
FWCI
Percentile
References
22
Citations per year

Authors

4

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Convolutional neural network
  • Computation
  • Differentiable function
  • Pattern recognition (psychology)
  • Task (project management)
  • Image (mathematics)
No related works found for this paper.