Cross-View Action Modeling, Learning, and Recognition
Northwestern University · University of California, Los Angeles
Abstract
Existing methods on video-based action recognition are generally view-dependent, i.e., performing recognition from the same views seen in the training data. We present a novel multiview spatio-temporal and-or graph (MST-AOG) representation for cross-view action recognition, i.e., the recognition is performed on the video from an unknown and unseen view. As a compositional model, MST-AOG compactly represents the hierarchical combinatorial structures of cross-view actions by explicitly modeling the geometry, appearance and motion variations. This paper proposes effective methods to learn the structure and parameters of MST-AOG. The inference based on MST-AOG enables action recognition from novel views. The…
Citation impact
- FWCI
- 14.58
- Percentile
- 100%
- References
- 33
Authors
5Topics & keywords
- Computer science
- Action recognition
- Artificial intelligence
- Robustness (evolution)
- Inference
- Representation (politics)
- Graph
- Action (physics)