A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images

Wang, Libo; Li, Rui; Duan, Chenxi; Zhang, Ce; Meng, Xiaoliang; Fang, Shenghui

doi:10.1109/lgrs.2022.3143368

articleIEEE Geoscience and Remote Sensing LettersJan 1, 2022GREEN OA

A Novel Transformer Based Semantic Segmentation Scheme for Fine-Resolution Remote Sensing Images

LWLibo Wang RLRui Li CDChenxi Duan CZCe Zhang XMXiaoliang Meng

Wuhan University · State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing · +2 more institutions

Indexed inarxivcrossref

Abstract

The fully convolutional network (FCN) with an encoder-decoder architecture has been the standard paradigm for semantic segmentation. The encoder-decoder architecture utilizes an encoder to capture multilevel feature maps, which are incorporated into the final prediction by a decoder. As the context is crucial for precise segmentation, tremendous effort has been made to extract such information in an intelligent fashion, including employing dilated/atrous convolutions or inserting attention modules. However, these endeavors are all based on the FCN architecture with ResNet or other backbones, which cannot fully exploit the context from the theoretical concept. By contrast, we introduce the Swin Transformer as…

Citation impact

384

total citations

FWCI: 40.45
Percentile: 100%
References: 41

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Encoder
Segmentation
Artificial intelligence
Transformer
Decoding methods
Computer vision
Image segmentation

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China
Award: 41971352