CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation With Transformers
Karlsruhe Institute of Technology · Hunan University
Abstract
Scene understanding based on image segmentation is a crucial component of autonomous vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting complementary features from the supplementary modality ( ${X}$ -modality). However, covering a wide variety of sensors with a modality-agnostic model remains an unresolved problem due to variations in sensor characteristics among different modalities. Unlike previous modality-specific methods, in this work, we propose a unified fusion framework, CMX, for RGB-X semantic segmentation. To generalize well across different modalities, that often include supplements as well as uncertainties, a unified cross-modal interaction is crucial for…
Citation impact
- FWCI
- 64.85
- Percentile
- 100%
- References
- 110
Authors
6Topics & keywords
- RGB color model
- Artificial intelligence
- Segmentation
- Modality (human–computer interaction)
- Computer science
- Computer vision
- Lidar
- Feature (linguistics)