Towards Robust Monocular Depth Estimation: Mixing Datasets for Zero-shot Cross-dataset Transfer

Intel (Germany) · Intel (United States)

Indexed indatacite

Abstract

The success of monocular depth estimation relies on large and diverse training sets. Due to the challenges associated with acquiring dense ground-truth depth across different environments at scale, a number of datasets with distinct characteristics and biases have emerged. We develop tools that enable mixing multiple datasets during training, even if their annotations are incompatible. In particular, we propose a robust training objective that is invariant to changes in depth range and scale, advocate the use of principled multi-objective learning to combine data from different sources, and highlight the importance of pretraining encoders on auxiliary tasks. Armed with these tools, we experiment with six…

Citation impact

1,150
total citations
FWCI
97.41
Percentile
100%
References
60
Citations per year

Authors

5

Topics & keywords

Keywords
  • Monocular
  • Computer science
  • Artificial intelligence
  • Ground truth
  • Generalization
  • Transfer of learning
  • Invariant (physics)
  • Machine learning
No related works found for this paper.