Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

Zhang, Cheng; Jiang, Wanshou; Zhang, Yuan; Wang, Wei; Zhao, Qing; Wang, Chenjie

doi:10.1109/tgrs.2022.3144894

articleIEEE Transactions on Geoscience and Remote SensingJan 1, 2022Closed access

Transformer and CNN Hybrid Deep Neural Network for Semantic Segmentation of Very-High-Resolution Remote Sensing Imagery

CZCheng Zhang WJWanshou Jiang YZYuan Zhang WWWei Wang QZQing Zhao

Wuhan University · State Key Laboratory of Information Engineering in Surveying Mapping and Remote Sensing

Indexed incrossref

Abstract

This article presents a transformer and convolutional neural network (CNN) hybrid deep neural network for semantic segmentation of very high resolution (VHR) remote sensing imagery. The model follows an encoder–decoder structure. The encoder module uses a new universal backbone Swin transformer to extract features to achieve better long-range spatial dependencies modeling. The decoder module draws on some effective blocks and successful strategies of CNN-based models in remote sensing image segmentation. In the middle of the framework, an atrous spatial pyramid pooling block based on depthwise separable convolution (SASPP) is applied to obtain a multiscale context. A U-shaped decoder is used to gradually…

Citation impact

310

total citations

FWCI: 30.34
Percentile: 100%
References: 54

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Segmentation
Convolutional neural network
Encoder
Pyramid (geometry)
Pattern recognition (psychology)
Deep learning

No related works found for this paper.