UNETR: Transformers for 3D Medical Image Segmentation

Vanderbilt University

Indexed incrossref

Abstract

Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past decade. In FCNNs, the encoder plays an integral role by learning both global and local features and contextual representations which can be utilized for semantic output prediction by the decoder. Despite their success, the locality of convolutional layers in FCNNs, limits the capability of learning long-range spatial dependencies. Inspired by the recent success of transformers for Natural Language Processing (NLP) in long-range sequence learning, we reformulate the task of volumetric (3D) medical image segmentation as a…

Citation impact

2,796
total citations
FWCI
544.70
Percentile
100%
References
60
Citations per year

Authors

8

Topics & keywords

Keywords
  • Computer science
  • Segmentation
  • Encoder
  • Transformer
  • Artificial intelligence
  • Convolutional neural network
  • Deep learning
  • Image segmentation
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.