articleOct 1, 2023Closed access
FLatten Transformer: Vision Transformer using Focused Linear Attention
Indexed incrossref
Abstract
The quadratic computation complexity of self-attention has been a persistent challenge when applying Transformer models to vision tasks. Linear attention, on the other hand, offers a much more efficient alternative with its linear complexity by approximating the Softmax operation through carefully designed mapping functions. However, current linear attention approaches either suffer from significant performance degradation or introduce additional computation overhead from the mapping functions. In this paper, we propose a novel Focused Linear Attention module to achieve both high efficiency and expressiveness. Specifically, we first analyze the factors contributing to the performance degradation of linear…
Citation impact
275
total citations
- FWCI
- 31.29
- Percentile
- 100%
- References
- 74
Citations per year
Authors
5Topics & keywords
Topics
Keywords
- Computer science
- Computation
- Transformer
- Quadratic equation
- Computer engineering
- Algorithm
- Theoretical computer science
- Mathematics
No related works found for this paper.