BinsFormer: Revisiting Adaptive Bins for Monocular Depth Estimation
Harbin Institute of Technology · Australian National University
Abstract
Monocular depth estimation (MDE) is a fundamental task in computer vision and has drawn increasing attention. Recently, some methods reformulate it as a classification-regression task to boost the model performance, where continuous depth is estimated via a linear combination of predicted probability distributions and discrete bins. In this paper, we present a novel framework called BinsFormer, tailored for the classification-regression-based depth estimation. It mainly focuses on two crucial components in the specific task: 1) proper generation of adaptive bins; and 2) sufficient interaction between probability distribution and bins predictions. To specify, we employ a Transformer decoder to generate bins,…
Citation impact
- FWCI
- 24.95
- Percentile
- 100%
- References
- 79
Authors
4Topics & keywords
- Artificial intelligence
- Computer science
- Monocular
- Computer vision
- Image processing
- Algorithm
- Mathematics
- Image (mathematics)