Samba: Semantic segmentation of remotely sensed images with state space model
University of Liverpool · Xi’an Jiaotong-Liverpool University · +3 more institutions
Abstract
High-resolution remotely sensed images pose challenges to traditional semantic segmentation networks, such as Convolutional Neural Networks (CNNs) and Vision Transformers (ViTs). CNN-based methods struggle to handle high-resolution images due to their limited receptive field, while ViT-based methods, despite having a global receptive field, face challenges when processing long sequences. Inspired by the Mamba network, which is based on a state space model (SSM) to efficiently capture global semantic information, we propose a semantic segmentation framework for high-resolution remotely sensed imagery, named Samba. Samba utilizes an encoder-decoder architecture, with multiple Samba blocks serving as the encoder…
Citation impact
- FWCI
- 31.52
- Percentile
- 100%
- References
- 59
Authors
7- QZQinfeng ZhuCorresponding
University of Liverpool, Xi’an Jiaotong-Liverpool University
- YCYuanzhi Cai
Commonwealth Scientific and Industrial Research Organisation, Mineral Resources, Australian Resources Research Centre
- YFYuan Fang
Xi’an Jiaotong-Liverpool University
- YYYihan Yang
Xi’an Jiaotong-Liverpool University
- CCCheng Chen
Xi’an Jiaotong-Liverpool University
Topics & keywords
- Segmentation
- Remote sensing
- Artificial intelligence
- Computer science
- Space (punctuation)
- State (computer science)
- Computer vision
- Cartography