ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

Touvron, Hugo; Bojanowski, Piotr; Caron, Mathilde; Cord, Matthieu; El-Nouby, Alaaeldin; Grave, Édouard; Izacard, Gautier; Joulin, Armand; Synnaeve, Gabriel; Verbeek, Jakob; Jeǵou, Hervé

doi:10.1109/tpami.2022.3206148

articleIEEE Transactions on Pattern Analysis and Machine IntelligenceSep 12, 2022GREEN OA

ResMLP: Feedforward Networks for Image Classification With Data-Efficient Training

HTHugo Touvron PBPiotr Bojanowski MCMathilde Caron MCMatthieu Cord AEAlaaeldin El-Nouby

Sorbonne Université

PubMed

Indexed inarxivcrossrefpubmed

Abstract

We present ResMLP, an architecture built entirely upon multi-layer perceptrons for image classification. It is a simple residual network that alternates (i) a linear layer in which image patches interact, independently and identically across channels, and (ii) a two-layer feed-forward network in which channels interact independently per patch. When trained with a modern training strategy using heavy data-augmentation and optionally distillation, it attains surprisingly good accuracy/complexity trade-offs on ImageNet. We also train ResMLP models in a self-supervised setup, to further remove priors from employing a labelled dataset. Finally, by adapting our model to machine translation we achieve surprisingly…

Citation impact

756

total citations

FWCI: 91.70
Percentile: 100%
References: 126

Citations per year

Authors

11

Topics & keywords

Topics

Keywords

Computer science
Artificial intelligence
Feed forward
Perceptron
Residual
Layer (electronics)
Image (mathematics)
Contextual image classification

No related works found for this paper.