ResViT: Residual Vision Transformers for Multimodal Medical Image Synthesis
Indexed inarxivcrossrefpubmed
Abstract
Generative adversarial models with convolutional neural network (CNN) backbones have recently been established as state-of-the-art in numerous medical image synthesis tasks. However, CNNs are designed to perform local processing with compact filters, and this inductive bias compromises learning of contextual features. Here, we propose a novel generative adversarial approach for medical image synthesis, ResViT, that leverages the contextual sensitivity of vision transformers along with the precision of convolution operators and realism of adversarial learning. ResViT's generator employs a central bottleneck comprising novel aggregated residual transformer (ART) blocks that synergistically combine residual…
Citation impact
536
total citations
- FWCI
- 49.42
- Percentile
- 100%
- References
- 136
Citations per year
Authors
3Topics & keywords
Topics
Keywords
- Computer vision
- Artificial intelligence
- Computer science
- Residual
- Transformer
- Image registration
- Medical imaging
- Machine vision
No related works found for this paper.