ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training
Indexed inarxivcrossrefpubmed
Abstract
We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly…
Citation impact
756
total citations
- FWCI
- 91.70
- Percentile
- 100%
- References
- 126
Citations per year
Authors
11Topics & keywords
Topics
Keywords
- Computer science
- Artificial intelligence
- Feed forward
- Perceptron
- Residual
- Layer (electronics)
- Image (mathematics)
- Contextual image classification
No related works found for this paper.