SurroundOcc: Multi-Camera 3D Occupancy Prediction for Autonomous Driving
National Engineering Research Center for Information Technology in Agriculture · Tsinghua University · +1 more institution
Abstract
3D scene understanding plays a vital role in vision-based autonomous driving. While most existing methods focus on 3D object detection, they have difficulty describing real-world objects of arbitrary shapes and infinite classes. Towards a more comprehensive perception of a 3D scene, in this paper, we propose a SurroundOcc method to predict the 3D occupancy with multi-camera images. We first extract multi-scale features for each image and adopt spatial 2D-3D attention to lift them to the 3D volume space. Then we apply 3D convolutions to progressively upsample the volume features and impose supervision on multiple levels. To obtain dense occupancy prediction, we design a pipeline to generate dense occupancy…
Citation impact
- FWCI
- 24.41
- Percentile
- 100%
- References
- 77
Authors
6- YWYi WeiCorresponding
National Engineering Research Center for Information Technology in Agriculture, Tsinghua University
- LZLinqing Zhao
Tianjin University
- WZWenzhao Zheng
Tsinghua University, National Engineering Research Center for Information Technology in Agriculture
- ZZZheng Zhu
- JZJie Zhou
Tsinghua University, National Engineering Research Center for Information Technology in Agriculture
Topics & keywords
- Occupancy
- Computer science
- Computer vision
- Artificial intelligence
- Ground truth
- Pipeline (software)
- Focus (optics)
- Expansive
- Sustainable cities and communities