DepthMaster: Taming Diffusion Models for Monocular Depth Estimation
University of Science and Technology of China · China Mobile (China)
Abstract
Monocular depth estimation within the diffusion-denoising paradigm demonstrates impressive generalization ability but suffers from low inference speed. Recent methods adopt a single-step deterministic paradigm to improve inference efficiency while maintaining comparable performance. However, they overlook the gap between generative and discriminative features, leading to suboptimal results. In this work, we propose DepthMaster, a single-step diffusion model designed to adapt generative features for the discriminative depth estimation task. First, to mitigate overfitting to texture details introduced by generative features, we propose a Feature Alignment module, which incorporates high-quality semantic features…
Citation impact
- FWCI
- 27.31
- Percentile
- 98%
- References
- 0
Authors
8- ZSZiyang SongCorresponding
University of Science and Technology of China
- ZWZerong Wang
China Mobile (China)
- BLBo Li
China Mobile (China)
- HZHao Zhang
China Mobile (China)
- RZRuijie Zhu
University of Science and Technology of China
Topics & keywords
- Monocular
- Diffusion
- Estimation
- Geology
- Computer science
- Computer vision
- Economics
- Physics