CMX: Cross-Modal Fusion for RGB-X Semantic Segmentation With Transformers

Karlsruhe Institute of Technology · Hunan University

Indexed incrossref

Abstract

Scene understanding based on image segmentation is a crucial component of autonomous vehicles. Pixel-wise semantic segmentation of RGB images can be advanced by exploiting complementary features from the supplementary modality ( ${X}$ -modality). However, covering a wide variety of sensors with a modality-agnostic model remains an unresolved problem due to variations in sensor characteristics among different modalities. Unlike previous modality-specific methods, in this work, we propose a unified fusion framework, CMX, for RGB-X semantic segmentation. To generalize well across different modalities, that often include supplements as well as uncertainties, a unified cross-modal interaction is crucial for…

Citation impact

577
total citations
FWCI
64.85
Percentile
100%
References
110
Citations per year

Authors

6

Topics & keywords

Keywords
  • RGB color model
  • Artificial intelligence
  • Segmentation
  • Modality (human–computer interaction)
  • Computer science
  • Computer vision
  • Lidar
  • Feature (linguistics)
No related works found for this paper.

Funding