articleIEEE Transactions on Geoscience and Remote SensingJan 1, 2022Closed access

Swin Transformer Embedding UNet for Remote Sensing Image Semantic Segmentation

Ministry of Education of the People's Republic of China · China University of Mining and Technology · +1 more institution

Indexed incrossref

Abstract

Global context information is essential for the semantic segmentation of remote sensing (RS) images. However, most existing methods rely on a convolutional neural network (CNN), which is challenging to directly obtain the global context due to the locality of the convolution operation. Inspired by the Swin transformer with powerful global modeling capabilities, we propose a novel semantic segmentation framework for RS images called ST-U-shaped network (UNet), which embeds the Swin transformer into the classical CNN-based UNet. ST-UNet constitutes a novel dual encoder structure of the Swin transformer and CNN in parallel. First, we propose a spatial interaction module (SIM), which encodes spatial information in…

No related works found for this paper.

Funding