Fusion-Mamba for Cross-Modality Object Detection
Beihang University · East China Normal University · +2 more institutions
Abstract
Cross-modality object detection aims to fuse complementary information from different modalities to improve model performance, which achieves a wider range of applications. However, traditional cross-modality fusion methods, based on CNN or Transformer, inadequately address the issue of pseudo-target information, which causes model attention dispersion to degrade object detection performance. In this paper, we investigate a novel cross-modality fusion approach by associating cross-modal features in a hidden state space based on an improved Mamba with a gating attention mechanism. We propose the Fusion-Mamba Block(FMB), designed to map cross-modal features into a hidden state space for interaction, thereby…
Citation impact
- FWCI
- 49.21
- Percentile
- 100%
- References
- 78
Authors
7- WDWenhao DongCorresponding
Beihang University
- HZHaodong Zhu
Beihang University
- SLShaohui Lin
East China Normal University
- XLXiaoyan Luo
Beihang University
- YSYunhang Shen
Tencent (China)
Topics & keywords
- Computer science
- Modality (human–computer interaction)
- Object (grammar)
- Artificial intelligence