Mixed Transformer U-Net for Medical Image Segmentation

Wang, Hongyi; Xie, Shiao; Lin, Lanfen; Iwamoto, Yutaro; Han, Xian‐Hua; Chen, Yen‐Wei; Tong, Ruofeng

doi:10.1109/icassp43922.2022.9746172

articleICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)Apr 27, 2022Closed access

Mixed Transformer U-Net for Medical Image Segmentation

HWHongyi Wang SXShiao Xie LLLanfen Lin YIYutaro Iwamoto XHXian‐Hua Han

Zhejiang University of Science and Technology · Ritsumeikan University · +3 more institutions

Indexed incrossref

Abstract

Though U-Net has achieved tremendous success in medical image segmentation tasks, it lacks the ability to explicitly model long-range dependencies. Therefore, Vision Transformers have emerged as alternative segmentation structures recently, for their innate ability of capturing long-range correlations through Self-Attention (SA). However, Transformers usually rely on large-scale pre-training and have high computational complexity. Furthermore, SA can only model self-affinities within a single sample, ignoring the potential correlations of the overall dataset. To address these problems, we propose a novel Transformer module named Mixed Transformer Module (MTM) for simultaneous inter- and intra- affinities…

Citation impact

356

total citations

FWCI: 19.36
Percentile: 100%
References: 36

Citations per year

Authors

7

Topics & keywords

Topics

Keywords

Segmentation
Transformer
Computer science
Image segmentation
Artificial intelligence
Gaussian
Pattern recognition (psychology)
Machine learning

No related works found for this paper.

Funding

NS
Natural Science Foundation of Zhejiang Province