FusionMamba: dynamic feature enhancement for multimodal image fusion with Mamba
Macao Polytechnic University · Hong Kong Polytechnic University
Abstract
Abstract Multimodal image fusion aims to integrate information from different imaging techniques to produce a comprehensive, detail-rich single image for downstream vision tasks. Existing methods based on local convolutional neural networks (CNNs) struggle to capture global features efficiently, while Transformer-based models are computationally expensive, although they excel at global modeling. Mamba addresses these limitations by leveraging selective structured state space models (S4) to effectively handle long-range dependencies while maintaining linear complexity. In this paper, we propose FusionMamba, a novel dynamic feature enhancement framework that aims to overcome the challenges faced by CNNs and…
Citation impact
- FWCI
- 52.55
- Percentile
- 100%
- References
- 60
Authors
5Topics & keywords
- Feature (linguistics)
- Image (mathematics)
- Computer science
- Image fusion
- Artificial intelligence
- Computer vision
- Fusion
- Pattern recognition (psychology)
- Industry, innovation and infrastructure