Beyond short snippets: Deep networks for video classification
University of Maryland, College Park · The University of Texas at Austin · +1 more institution
Abstract
Convolutional neural networks (CNNs) have been extensively applied for image recognition problems giving state-of-the-art results on recognition, detection, segmentation and retrieval. In this work we propose and evaluate several deep neural network architectures to combine image information across a video over longer time periods than previously attempted. We propose two methods capable of handling full length videos. The first method explores various convolutional temporal feature pooling architectures, examining the various design choices which need to be made when adapting a CNN for this task. The second proposed method explicitly models the video as an ordered sequence of frames. For this purpose we…
Citation impact
- FWCI
- 108.93
- Percentile
- 100%
- References
- 40
Authors
6Topics & keywords
- Computer science
- Pooling
- Convolutional neural network
- Artificial intelligence
- Optical flow
- Deep learning
- Pattern recognition (psychology)
- Segmentation