Temporal Action Detection with Structured Segment Networks
Chinese University of Hong Kong · ETH Zurich
Abstract
Detecting actions in untrimmed videos is an important yet challenging task. In this paper, we present the structured segment network (SSN), a novel framework which models the temporal structure of each action instance via a structured temporal pyramid. On top of the pyramid, we further introduce a decomposed discriminative model comprising two classifiers, respectively for classifying actions and determining completeness. This allows the framework to effectively distinguish positive proposals from background or incomplete ones, thus leading to both accurate recognition and localization. These components are integrated into a unified network that can be efficiently trained in an end-to-end fashion.…
Citation impact
- FWCI
- 32.25
- Percentile
- 100%
- References
- 76
Authors
6Topics & keywords
- Discriminative model
- Computer science
- Artificial intelligence
- Pyramid (geometry)
- Task (project management)
- Pattern recognition (psychology)
- Machine learning
- Action (physics)
- Reduced inequalities