UntrimmedNets for Weakly Supervised Action Recognition and Detection
ETH Zurich · Chinese University of Hong Kong
Abstract
Current action recognition methods heavily rely on trimmed videos for model training. However, it is expensive and time-consuming to acquire a large-scale trimmed video dataset. This paper presents a new weakly supervised architecture, called UntrimmedNet, which is able to directly learn action recognition models from untrimmed videos without the requirement of temporal annotations of action instances. Our UntrimmedNet couples two important components, the classification module and the selection module, to learn the action models and reason about the temporal duration of action instances, respectively. These two components are implemented with feed-forward networks, and UntrimmedNet is therefore an end-to-end…
Citation impact
- FWCI
- 20.70
- Percentile
- 100%
- References
- 80
Authors
4Topics & keywords
- Computer science
- Exploit
- Action recognition
- Action (physics)
- Artificial intelligence
- Pattern recognition (psychology)
- Machine learning
- Selection (genetic algorithm)