HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

Iran University of Science and Technology · RWTH Aachen University · +4 more institutions

Indexed incrossref

Abstract

Convolutional neural networks (CNNs) have been the consensus for medical image segmentation tasks. However, they suffer from the limitation in modeling long-range dependencies and spatial correlations due to the nature of convolution operation. Although transformers were first developed to address this issue, they fail to capture low-level features. In contrast, it is demonstrated that both local and global features are crucial for dense prediction, such as segmenting in challenging contexts. In this paper, we propose HiFormer, a novel method that efficiently bridges a CNN and a transformer for medical image segmentation. Specifically, we design two multi-scale feature representations using the seminal Swin…

Citation impact

467
total citations
FWCI
27.41
Percentile
100%
References
65
Citations per year

Authors

7

Topics & keywords

Keywords
  • Computer science
  • Encoder
  • Segmentation
  • Artificial intelligence
  • Transformer
  • Convolutional neural network
  • Image segmentation
  • Pattern recognition (psychology)
No related works found for this paper.