preprintJul 1, 2017Closed access

Unsupervised Learning of Depth and Ego-Motion from Video

University of California, Berkeley · Berkeley College · +1 more institution

Indexed incrossref

Abstract

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. In common with recent work [10, 14, 16], we use an end-to-end learning approach with view synthesis as the supervisory signal. In contrast to the previous work, our method is completely unsupervised, requiring only monocular video sequences for training. Our method uses single-view depth and multiview pose networks, with a loss based on warping nearby views to the target using the computed depth and pose. The networks are thus coupled by the loss during training, but can be applied independently at test time. Empirical evaluation on the KITTI dataset demonstrates the…

Citation impact

2,796
total citations
FWCI
93.80
Percentile
100%
References
79
Citations per year

Authors

4

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Monocular
  • Image warping
  • Computer vision
  • Ground truth
  • Dynamic time warping
  • Unsupervised learning
No related works found for this paper.