Virtual Sparse Convolution for Multimodal 3D Object Detection
Xiamen University · Texas A&M University
Abstract
Recently, virtuall pseudo-point-based 3D object detection that seamlessly fuses RGB images and LiDAR data by depth completion has gained great attention. However, virtual points generated from an image are very dense, introducing a huge amount of redundant computation during detection. Meanwhile, noises brought by inaccurate depth completion significantly degrade detection precision. This paper proposes a fast yet effective backbone, termed Vir-ConvNet, based on a new operator VirConv (Virtual Sparse Convolution), for virtual-point-based 3D object detection. VirConv consists of two key designs: (1) StVD (Stochastic Voxel Discard) and (2) NRConv (Noise-Resistant Sub-manifold Convolution). StVD alleviates the…
Citation impact
- FWCI
- 22.22
- Percentile
- 100%
- References
- 56
Authors
5Topics & keywords
- Computer science
- Pipeline (software)
- Artificial intelligence
- Object detection
- Convolution (computer science)
- Computer vision
- Noise (video)
- Voxel