articleOct 1, 2023Closed access

SwiftFormer: Efficient Additive Attention for Transformer-based Real-time Mobile Vision Applications

Mohamed bin Zayed University of Artificial Intelligence · University of California, Merced · +2 more institutions

Indexed incrossref

Abstract

Self-attention has become a defacto choice for capturing global context in various vision applications. However, its quadratic computational complexity with respect to image resolution limits its use in real-time applications, especially for deployment on resource-constrained mobile devices. Although hybrid approaches have been proposed to combine the advantages of convolutions and self-attention for a better speed-accuracy trade-off, the expensive matrix multiplication operations in self-attention remain a bottleneck. In this work, we introduce a novel efficient additive attention mechanism that effectively replaces the quadratic matrix multiplication operations with linear element-wise multiplications. Our…

No related works found for this paper.