Mixed Transformer U-Net for Medical Image Segmentation
Zhejiang University of Science and Technology · Ritsumeikan University · +3 more institutions
Abstract
Though U-Net has achieved tremendous success in medical image segmentation tasks, it lacks the ability to explicitly model long-range dependencies. Therefore, Vision Transformers have emerged as alternative segmentation structures recently, for their innate ability of capturing long-range correlations through Self-Attention (SA). However, Transformers usually rely on large-scale pre-training and have high computational complexity. Furthermore, SA can only model self-affinities within a single sample, ignoring the potential correlations of the overall dataset. To address these problems, we propose a novel Transformer module named Mixed Transformer Module (MTM) for simultaneous inter- and intra- affinities…
Citation impact
- FWCI
- 19.36
- Percentile
- 100%
- References
- 36
Authors
7- HWHongyi WangCorresponding
Zhejiang University of Science and Technology
- SXShiao Xie
Zhejiang University of Science and Technology
- LLLanfen Lin
Zhejiang University of Science and Technology
- YIYutaro Iwamoto
Ritsumeikan University
- XHXian‐Hua Han
Yamaguchi University, Artificial Intelligence in Medicine (Canada)
Topics & keywords
- Segmentation
- Transformer
- Computer science
- Image segmentation
- Artificial intelligence
- Gaussian
- Pattern recognition (psychology)
- Machine learning