articleMachine Intelligence ResearchSep 13, 2023HYBRID OA

DepthFormer: Exploiting Long-range Correlation and Local Information for Accurate Monocular Depth Estimation

Harbin Institute of Technology · University of Science and Technology of China

Indexed incrossref

Abstract

Abstract This paper aims to address the problem of supervised monocular depth estimation. We start with a meticulous pilot study to demonstrate that the long-range correlation is essential for accurate depth estimation. Moreover, the Transformer and convolution are good at long-range and close-range depth estimation, respectively. Therefore, we propose to adopt a parallel encoder architecture consisting of a Transformer branch and a convolution branch. The former can model global context with the effective attention mechanism and the latter aims to preserve the local information as the Transformer lacks the spatial inductive bias in modeling such contents. However, independent branches lead to a shortage of…

No related works found for this paper.