articleOct 1, 2017Closed access

End-to-End Learning of Geometry and Context for Deep Stereo Regression

Indexed incrossref

Abstract

We propose a novel deep learning architecture for regressing disparity from a rectified pair of stereo images. We leverage knowledge of the problem's geometry to form a cost volume using deep feature representations. We learn to incorporate contextual information using 3-D convolutions over this volume. Disparity values are regressed from the cost volume using a proposed differentiable soft argmin operation, which allows us to train our method end-to-end to sub-pixel accuracy without any additional post-processing or regularization. We evaluate our method on the Scene Flow and KITTI datasets and on KITTI we set a new stateof-the-art benchmark, while being significantly faster than competing approaches.

Citation impact

1,510
total citations
FWCI
46.30
Percentile
100%
References
67
Citations per year

Authors

7

Topics & keywords

Keywords
  • Artificial intelligence
  • Leverage (statistics)
  • Computer science
  • Deep learning
  • Regularization (linguistics)
  • Differentiable function
  • End-to-end principle
  • Computer vision
UN Sustainable Development Goals
  • Sustainable cities and communities
No related works found for this paper.