P-CNN: Pose-Based CNN Features for Action Recognition

Chéron, Guilhem; Laptev, Ivan; Schmid, Cordelia

doi:10.1109/iccv.2015.368

preprintDec 1, 2015GREEN OA

P-CNN: Pose-Based CNN Features for Action Recognition

GCGuilhem Chéron ILIvan Laptev CSCordelia Schmid

Université Grenoble Alpes · Institut national de recherche en informatique et en automatique · +5 more institutions

Indexed inarxivcrossref

Abstract

This work targets human action recognition in video. While recent methods typically represent actions by statistics of local video features, here we argue for the importance of a representation derived from human pose. To this end we propose a new Pose-based Convolutional Neural Network descriptor (P-CNN) for action recognition. The descriptor aggregates motion and appearance information along tracks of human body parts. We investigate different schemes of temporal aggregation and experiment with P-CNN features obtained both for automatically estimated and manually annotated human poses. We evaluate our method on the recent and challenging JHMDB and MPII Cooking datasets. For both datasets our method shows…

Citation impact

638

total citations

FWCI: 38.56
Percentile: 100%
References: 59

Citations per year

Authors

3

Topics & keywords

Topics

Keywords

Convolutional neural network
Computer science
Action recognition
Artificial intelligence
Representation (politics)
Pattern recognition (psychology)
Action (physics)
Motion (physics)

No related works found for this paper.