articleOct 1, 2017Closed access
End-to-End Learning of Geometry and Context for Deep Stereo Regression
Indexed incrossref
Abstract
We propose a novel deep learning architecture for regressing disparity from a rectified pair of stereo images. We leverage knowledge of the problem's geometry to form a cost volume using deep feature representations. We learn to incorporate contextual information using 3-D convolutions over this volume. Disparity values are regressed from the cost volume using a proposed differentiable soft argmin operation, which allows us to train our method end-to-end to sub-pixel accuracy without any additional post-processing or regularization. We evaluate our method on the Scene Flow and KITTI datasets and on KITTI we set a new stateof-the-art benchmark, while being significantly faster than competing approaches.
Citation impact
1,510
total citations
- FWCI
- 46.30
- Percentile
- 100%
- References
- 67
Citations per year
Authors
7Topics & keywords
Topics
Keywords
- Artificial intelligence
- Leverage (statistics)
- Computer science
- Deep learning
- Regularization (linguistics)
- Differentiable function
- End-to-end principle
- Computer vision
UN Sustainable Development Goals
- Sustainable cities and communities
No related works found for this paper.