articleJun 1, 2014Closed access

Large-Scale Video Classification with Convolutional Neural Networks

Stanford University · Google (United States)

Indexed incrossref

Abstract

Convolutional Neural Networks (CNNs) have been established as a powerful class of models for image recognition problems. Encouraged by these results, we provide an extensive empirical evaluation of CNNs on large-scale video classification using a new dataset of 1 million YouTube videos belonging to 487 classes. We study multiple approaches for extending the connectivity of a CNN in time domain to take advantage of local spatio-temporal information and suggest a multiresolution, foveated architecture as a promising way of speeding up the training. Our best spatio-temporal networks display significant performance improvements compared to strong feature-based baselines (55.3% to 63.9%), but only a surprisingly…

Citation impact

6,335
total citations
FWCI
294.33
Percentile
100%
References
38
Citations per year

Authors

6

Topics & keywords

Keywords
  • Computer science
  • Convolutional neural network
  • Artificial intelligence
  • Pattern recognition (psychology)
  • Generalization
  • Feature (linguistics)
  • Contextual image classification
  • Frame (networking)
No related works found for this paper.

Funding