ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Sorbonne Université

PubMed
Indexed inarxivcrossrefpubmed

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly…

Citation impact

756
total citations
FWCI
91.70
Percentile
100%
References
126
Citations per year

Authors

11

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Feed forward
  • Perceptron
  • Residual
  • Layer (electronics)
  • Image (mathematics)
  • Contextual image classification
No related works found for this paper.