articleVisual IntelligenceDec 31, 2024DIAMOND OA

FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba

Macao Polytechnic University · Hong Kong Polytechnic University

Indexed incrossrefdoaj

Abstract

Abstract Multimodal image fusion aims to integrate information from different imaging techniques to produce a comprehensive, detail-rich single image for downstream vision tasks. Existing methods based on local convolutional neural networks (CNNs) struggle to capture global features efficiently, while Transformer-based models are computationally expensive, although they excel at global modeling. Mamba addresses these limitations by leveraging selective structured state space models (S4) to effectively handle long-range dependencies while maintaining linear complexity. In this paper, we propose FusionMamba, a novel dynamic feature enhancement framework that aims to overcome the challenges faced by CNNs and…

Citation impact

179
total citations
FWCI
52.55
Percentile
100%
References
60
Citations per year

Authors

5

Topics & keywords

Keywords
  • Feature (linguistics)
  • Image (mathematics)
  • Computer science
  • Image fusion
  • Artificial intelligence
  • Computer vision
  • Fusion
  • Pattern recognition (psychology)
UN Sustainable Development Goals
  • Industry, innovation and infrastructure
No related works found for this paper.

Funding