BEVFormer v2: Adapting Modern Image Backbones to Bird's-Eye-View Recognition via Perspective Supervision
Tsinghua University · Centre for Artificial Intelligence and Robotics · +5 more institutions
Abstract
We present a novel bird's-eye-view (BEV) detector with perspective supervision, which converges faster and bet-suits modern image backbones. Existing state-of-the-art BEV detectors are often tied to certain depth pretrained backbones like Vo Vn et, hindering the synergy between booming image backbones and BEV detectors. To address this limitation, we prioritize easing the optimization of BEV detectors by introducing perspective view supervision. To this end, we propose a two-stage BEV detector; where proposals from the perspective head are fed into the bird’ s-eye-view head for final predictions. To evaluate the effectiveness of our model, we conduct extensive ablation studies focusing on the form of…
Citation impact
- FWCI
- 32.28
- Percentile
- 100%
- References
- 62
Authors
12Topics & keywords
- Perspective (graphical)
- Detector
- Computer science
- Code (set theory)
- Artificial intelligence
- Image (mathematics)
- Computer vision
- Telecommunications