Recurrent Network Models for Human Dynamics
University of California, Berkeley
Abstract
We propose the Encoder-Recurrent-Decoder (ERD) model for recognition and prediction of human body pose in videos and motion capture. The ERD model is a recurrent neural network that incorporates nonlinear encoder and decoder networks before and after recurrent layers. We test instantiations of ERD architectures in the tasks of motion capture (mocap) generation, body pose labeling and body pose forecasting in videos. Our model handles mocap training data across multiple subjects and activity domains, and synthesizes novel motions while avoiding drifting for long periods of time. For human pose labeling, ERD outperforms a per frame body part detector by resolving left-right body part confusions. For video pose…
Citation impact
- FWCI
- 18.72
- Percentile
- 100%
- References
- 73
Authors
4Topics & keywords
- Computer science
- Artificial intelligence
- Recurrent neural network
- Autoencoder
- Encoder
- Feature (linguistics)
- Motion (physics)
- Optical flow
- Quality Education