FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba

Xie, Xinyu; Cui, Yawen; Tan, Tao; Zheng, Xubin; Yu, Zitong

doi:10.1007/s44267-024-00072-9

articleVisual IntelligenceDec 31, 2024DIAMOND OA

FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba

XXXinyu Xie YCYawen Cui TTTao Tan XZXubin Zheng ZYZitong Yu

Macao Polytechnic University · Hong Kong Polytechnic University

Indexed incrossrefdoaj

Abstract

Abstract Multimodal image fusion aims to integrate information from different imaging techniques to produce a comprehensive, detail-rich single image for downstream vision tasks. Existing methods based on local convolutional neural networks (CNNs) struggle to capture global features efficiently, while Transformer-based models are computationally expensive, although they excel at global modeling. Mamba addresses these limitations by leveraging selective structured state space models (S4) to effectively handle long-range dependencies while maintaining linear complexity. In this paper, we propose FusionMamba, a novel dynamic feature enhancement framework that aims to overcome the challenges faced by CNNs and…

Citation impact

179

total citations

FWCI: 52.55
Percentile: 100%
References: 60

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Feature (linguistics)
Image (mathematics)
Computer science
Image fusion
Artificial intelligence
Computer vision
Fusion
Pattern recognition (psychology)

UN Sustainable Development Goals

Industry, innovation and infrastructure

No related works found for this paper.