Unsupervised Learning of Depth and Ego-Motion from Video

Zhou, Tinghui; Brown, Matthew; Snavely, Noah; Lowe, David

doi:10.1109/cvpr.2017.700

preprintJul 1, 2017Closed access

Unsupervised Learning of Depth and Ego-Motion from Video

TZTinghui Zhou MBMatthew Brown NSNoah Snavely DLDavid Lowe

University of California, Berkeley · Berkeley College · +1 more institution

Indexed incrossref

Abstract

We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. In common with recent work [10, 14, 16], we use an end-to-end learning approach with view synthesis as the supervisory signal. In contrast to the previous work, our method is completely unsupervised, requiring only monocular video sequences for training. Our method uses single-view depth and multiview pose networks, with a loss based on warping nearby views to the target using the computed depth and pose. The networks are thus coupled by the loss during training, but can be applied independently at test time. Empirical evaluation on the KITTI dataset demonstrates the…

Citation impact

2,796

total citations

FWCI: 93.80
Percentile: 100%
References: 79

Citations per year

Authors

4

Topics & keywords

Topics

Keywords

Artificial intelligence
Computer science
Monocular
Image warping
Computer vision
Ground truth
Dynamic time warping
Unsupervised learning

No related works found for this paper.