Coarse-to-Fine Volumetric Prediction for Single-Image 3D Human Pose
California University of Pennsylvania · Toronto Metropolitan University
Abstract
This paper addresses the challenge of 3D human pose estimation from a single color image. Despite the general success of the end-to-end learning paradigm, top performing approaches employ a two-step solution consisting of a Convolutional Network (ConvNet) for 2D joint localization and a subsequent optimization step to recover 3D pose. In this paper, we identify the representation of 3D pose as a critical issue with current ConvNet approaches and make two important contributions towards validating the value of end-to-end learning for this task. First, we propose a fine discretization of the 3D space around the subject and train a ConvNet to predict per voxel likelihoods for each joint. This creates a natural…
Citation impact
- FWCI
- 35.12
- Percentile
- 100%
- References
- 61
Authors
4Topics & keywords
- Computer science
- Artificial intelligence
- Pose
- Representation (politics)
- Voxel
- Convolutional neural network
- Feature learning
- Pattern recognition (psychology)