MetaFormer Baselines for Vision

National University of Singapore

PubMed
Indexed incrossrefpubmed

Abstract

MetaFormer, the abstracted architecture of Transformer, has been found to play a significant role in achieving competitive performance. In this paper, we further explore the capacity of MetaFormer, again, by migrating our focus away from the token mixer design: we introduce several baseline models under MetaFormer using the most basic or common mixers, and demonstrate their gratifying performance. We summarize our observations as follows: (1) MetaFormer ensures solid lower bound of performance. By merely adopting identity mapping as the token mixer, the MetaFormer model, termed IdentityFormer, achieves [Formula: see text]80% accuracy on ImageNet-1 K. (2) MetaFormer works well with arbitrary token mixers. When…

Citation impact

246
total citations
FWCI
27.96
Percentile
100%
References
132
Citations per year

Authors

8

Topics & keywords

Keywords
  • Security token
  • Computer science
  • Separable space
  • Suzuki-Kasami algorithm
  • Artificial intelligence
  • Algorithm
  • Mathematics
  • Computer network
No related works found for this paper.

Funding