Action recognition with trajectory-pooled deep-convolutional descriptors

Wang, Limin; Qiao, Yu; Tang, Xiaoou

doi:10.1109/cvpr.2015.7299059

preprintJun 1, 2015GREEN OA

Action recognition with trajectory-pooled deep-convolutional descriptors

LWLimin Wang YQYu Qiao XTXiaoou Tang

Shenzhen Institutes of Advanced Technology · Chinese University of Hong Kong

Indexed inarxivcrossref

Abstract

Visual features are of vital importance for human action understanding in videos. This paper presents a new video representation, called trajectory-pooled deep-convolutional descriptor (TDD), which shares the merits of both hand-crafted features [31] and deep-learned features [24]. Specifically, we utilize deep architectures to learn discriminative convolutional feature maps, and conduct trajectory-constrained pooling to aggregate these convolutional features into effective descriptors. To enhance the robustness of TDDs, we design two normalization methods to transform convolutional feature maps, namely spatiotemporal normalization and channel normalization. The advantages of our features come from (i) TDDs…

Citation impact

1,223

total citations

FWCI: 103.69
Percentile: 100%
References: 63

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Discriminative model
Artificial intelligence
Computer science
Pooling
Normalization (sociology)
Convolutional neural network
Pattern recognition (psychology)
Deep learning

UN Sustainable Development Goals

Reduced inequalities

No related works found for this paper.