articleJul 1, 2017Closed access

Temporal Convolutional Networks for Action Segmentation and Detection

Johns Hopkins University

Indexed incrossref

Abstract

The ability to identify and temporally segment fine-grained human actions throughout a video is crucial for robotics, surveillance, education, and beyond. Typical approaches decouple this problem by first extracting local spatiotemporal features from video frames and then feeding them into a temporal classifier that captures high-level temporal patterns. We describe a class of temporal models, which we call Temporal Convolutional Networks (TCNs), that use a hierarchy of temporal convolutions to perform fine-grained action segmentation or detection. Our Encoder-Decoder TCN uses pooling and upsampling to efficiently capture long-range temporal patterns whereas our Dilated TCN uses dilated convolutions. We show…

Citation impact

2,014
total citations
FWCI
33.09
Percentile
100%
References
45
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Upsampling
  • Artificial intelligence
  • Segmentation
  • Pooling
  • Convolutional neural network
  • Classifier (UML)
  • Pattern recognition (psychology)
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.