Long-term recurrent convolutional networks for visual recognition and description

Donahue, Jeff; Hendricks, Lisa Anne; Guadarrama, Sergio; Rohrbach, Marcus; Venugopalan, Subhashini; Darrell, Trevor; Saenko, Kate

doi:10.1109/cvpr.2015.7298878

articleJun 1, 2015Closed access

Long-term recurrent convolutional networks for visual recognition and description

JDJeff Donahue LALisa Anne Hendricks SGSergio Guadarrama MRMarcus Rohrbach SVSubhashini Venugopalan

University of California, Berkeley · International Computer Science Institute · +2 more institutions

Indexed incrossref

Abstract

Models based on deep convolutional networks have dominated recent image interpretation tasks; we investigate whether models which are also recurrent, or “temporally deep”, are effective for tasks involving sequences, visual and otherwise. We develop a novel recurrent convolutional architecture suitable for large-scale visual learning which is end-to-end trainable, and demonstrate the value of these models on benchmark video recognition tasks, image description and retrieval problems, and video narration challenges. In contrast to current models which assume a fixed spatio-temporal receptive field or simple temporal averaging for sequential processing, recurrent convolutional models are “doubly deep” in that…

Citation impact

5,261

total citations

FWCI: 355.97
Percentile: 100%
References: 78

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Benchmark (surveying)
Deep learning
Convolutional neural network
Recurrent neural network
Pattern recognition (psychology)
Machine learning

UN Sustainable Development Goals

Quality Education

No related works found for this paper.