CLOCs: Camera-LiDAR Object Candidates Fusion for 3D Object Detection
Michigan United · Michigan State University
Abstract
There have been significant advances in neural networks for both 3D object detection using LiDAR and 2D object detection using video. However, it has been surprisingly difficult to train networks to effectively use both modalities in a way that demonstrates gain over single-modality networks. In this paper, we propose a novel Camera-LiDAR Object Candidates (CLOCs) fusion network. CLOCs fusion provides a low-complexity multi-modal fusion framework that significantly improves the performance of single-modality detectors. CLOCs operates on the combined output candidates before Non-Maximum Suppression (NMS) of any 2D and any 3D detector, and is trained to leverage their geometric and semantic consistencies to…
Citation impact
- FWCI
- 21.58
- Percentile
- 100%
- References
- 40
Authors
3Topics & keywords
- Lidar
- Object detection
- Leverage (statistics)
- Artificial intelligence
- Computer science
- Computer vision
- Benchmark (surveying)
- Modality (human–computer interaction)