Abstract
MetaFormer, the abstracted architecture of Transformer, has been found to play a significant role in achieving competitive performance. In this paper, we further explore the capacity of MetaFormer, again, by migrating our focus away from the token mixer design: we introduce several baseline models under MetaFormer using the most basic or common mixers, and demonstrate their gratifying performance. We summarize our observations as follows: (1) MetaFormer ensures solid lower bound of performance. By merely adopting identity mapping as the token mixer, the MetaFormer model, termed IdentityFormer, achieves [Formula: see text]80% accuracy on ImageNet-1 K. (2) MetaFormer works well with arbitrary token mixers. When…
Citation impact
246
total citations
- FWCI
- 27.96
- Percentile
- 100%
- References
- 132
Citations per year
Authors
8Topics & keywords
Topics
Keywords
- Security token
- Computer science
- Separable space
- Suzuki-Kasami algorithm
- Artificial intelligence
- Algorithm
- Mathematics
- Computer network
No related works found for this paper.