UNETR: Transformers for 3D Medical Image Segmentation

Hatamizadeh, Ali; Tang, Yucheng; Nath, Vishwesh; Yang, Dong; Myronenko, Andriy; Landman, Bennett A.; Roth, Holger R.; Xu, Daguang

doi:10.1109/wacv51458.2022.00181

article2022 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV)Jan 1, 2022Closed access

UNETR: Transformers for 3D Medical Image Segmentation

AHAli Hatamizadeh YTYucheng Tang VNVishwesh Nath DYDong Yang AMAndriy Myronenko

Vanderbilt University

Indexed incrossref

Abstract

Fully Convolutional Neural Networks (FCNNs) with contracting and expanding paths have shown prominence for the majority of medical image segmentation applications since the past decade. In FCNNs, the encoder plays an integral role by learning both global and local features and contextual representations which can be utilized for semantic output prediction by the decoder. Despite their success, the locality of convolutional layers in FCNNs, limits the capability of learning long-range spatial dependencies. Inspired by the recent success of transformers for Natural Language Processing (NLP) in long-range sequence learning, we reformulate the task of volumetric (3D) medical image segmentation as a…

Citation impact

2,796

total citations

FWCI: 544.70
Percentile: 100%
References: 60

Citations per year

Authors

8

Topics & keywords

Topics

Keywords

Computer science
Segmentation
Encoder
Transformer
Artificial intelligence
Convolutional neural network
Deep learning
Image segmentation

UN Sustainable Development Goals

Quality Education

No related works found for this paper.