articleIEEE Transactions on Image ProcessingJan 1, 2024Closed access

BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation

Harbin Institute of Technology · Australian National University

PubMed
Indexed incrossrefpubmed

Abstract

Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn increasing attention. Recently, some methods reformulate it as a classification-regression task to boost the model performance, where continuous depth is estimated via a linear combination of predicted probability distributions and discrete bins. In this paper, we present a novel framework called BinsFormer, tailored for the classification-regression-based depth estimation. It mainly focuses on two crucial components in the specific task: 1) proper generation of adaptive bins; and 2) sufficient interaction between probability distribution and bins predictions. To specify, we employ a Transformer decoder to generate bins,…

Citation impact

118
total citations
FWCI
24.95
Percentile
100%
References
79
Citations per year

Authors

4

Topics & keywords

Keywords
  • Artificial intelligence
  • Computer science
  • Monocular
  • Computer vision
  • Image processing
  • Algorithm
  • Mathematics
  • Image (mathematics)
No related works found for this paper.

Funding