VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking
Chinese University of Hong Kong · University of Hong Kong · +2 more institutions
Abstract
3D object detectors usually rely on hand-crafted proxies, e.g., anchors or centers, and translate well-studied 2D frameworks to 3D. Thus, sparse voxel features need to be densified and processed by dense prediction heads, which inevitably costs extra computation. In this paper, we instead propose VoxelNext for fully sparse 3D object detection. Our core insight is to predict objects directly based on sparse voxel features, without relying on hand-crafted proxies. Our strong sparse convolutional network VoxelNeXt detects and tracks 3D objects through voxel features entirely. It is an elegant and efficient framework, with no need for sparse-to-dense conversion or NMS post-processing. Our method achieves a better…
Citation impact
- FWCI
- 44.50
- Percentile
- 100%
- References
- 80
Authors
5Topics & keywords
- Computer science
- Benchmark (surveying)
- Voxel
- Artificial intelligence
- Object detection
- Sparse approximation
- Computation
- Computer vision