articleJun 1, 2015Closed access

Long-term recurrent convolutional networks for visual recognition and description

University of California, Berkeley · International Computer Science Institute · +2 more institutions

Indexed incrossref

Abstract

Models based on deep convolutional networks have dominated recent image interpretation tasks; we investigate whether models which are also recurrent, or “temporally deep”, are effective for tasks involving sequences, visual and otherwise. We develop a novel recurrent convolutional architecture suitable for large-scale visual learning which is end-to-end trainable, and demonstrate the value of these models on benchmark video recognition tasks, image description and retrieval problems, and video narration challenges. In contrast to current models which assume a fixed spatio-temporal receptive field or simple temporal averaging for sequential processing, recurrent convolutional models are “doubly deep” in that…

Citation impact

5,261
total citations
FWCI
355.97
Percentile
100%
References
78
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Benchmark (surveying)
  • Deep learning
  • Convolutional neural network
  • Recurrent neural network
  • Pattern recognition (psychology)
  • Machine learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding