Rethinking Mobile Block for Efficient Attention-based Models
Zhejiang University · Tencent (China) · +2 more institutions
Abstract
This paper focuses on developing modern, efficient, lightweight models for dense predictions while trading off parameters, FLOPs, and performance. Inverted Residual Block (IRB) serves as the infrastructure for lightweight CNNs, but no counterpart has been recognized by attention-based studies. This work rethinks lightweight infrastructure from efficient IRB and effective components of Transformer from a unified perspective, extending CNN-based IRB to attention-based models and abstracting a one-residual Meta Mobile Block (MMB) for lightweight model design. Following simple but effective design criterion, we deduce a modern Inverted Residual Mobile Block (iRMB) and build a ResNetlike Efficient MOdel (EMO) with…
Citation impact
- FWCI
- 30.02
- Percentile
- 100%
- References
- 113
Authors
10Topics & keywords
- Computer science
- FLOPS
- Residual
- Block (permutation group theory)
- Computer engineering
- Perspective (graphical)
- Transformer
- Mobile device
- Industry, innovation and infrastructure