BEVDepth: Acquisition of Reliable Depth for Multi-View 3D Object Detection
Institute of Computing Technology · University of Chinese Academy of Sciences · +4 more institutions
Abstract
In this research, we propose a new 3D object detector with a trustworthy depth estimation, dubbed BEVDepth, for camera-based Bird's-Eye-View~(BEV) 3D object detection. Our work is based on a key observation -- depth estimation in recent approaches is surprisingly inadequate given the fact that depth is essential to camera 3D detection. Our BEVDepth resolves this by leveraging explicit depth supervision. A camera-awareness depth estimation module is also introduced to facilitate the depth predicting capability. Besides, we design a novel Depth Refinement Module to counter the side effects carried by imprecise feature unprojection. Aided by customized Efficient Voxel Pooling and multi-frame mechanism, BEVDepth…
Citation impact
- FWCI
- 38.31
- Percentile
- 100%
- References
- 73
Authors
8- YLYinhao LiCorresponding
Institute of Computing Technology, University of Chinese Academy of Sciences
- ZGZheng Ge
Vi Technology (United States), Megvii (China)
- GYGuanyi Yu
Vi Technology (United States), Megvii (China)
- JYJinrong Yang
Huazhong University of Science and Technology
- ZWZengran Wang
Vi Technology (United States), Megvii (China)
Topics & keywords
- Computer science
- Artificial intelligence
- Computer vision
- Pooling
- Object detection
- Frame (networking)
- Object (grammar)
- Detector