articleOct 1, 2023Closed access

FLatten Transformer: Vision Transformer using Focused Linear Attention

Tsinghua University

Indexed incrossref

Abstract

The quadratic computation complexity of self-attention has been a persistent challenge when applying Transformer models to vision tasks. Linear attention, on the other hand, offers a much more efficient alternative with its linear complexity by approximating the Softmax operation through carefully designed mapping functions. However, current linear attention approaches either suffer from significant performance degradation or introduce additional computation overhead from the mapping functions. In this paper, we propose a novel Focused Linear Attention module to achieve both high efficiency and expressiveness. Specifically, we first analyze the factors contributing to the performance degradation of linear…

Citation impact

275
total citations
FWCI
31.29
Percentile
100%
References
74
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Computation
  • Transformer
  • Quadratic equation
  • Computer engineering
  • Algorithm
  • Theoretical computer science
  • Mathematics
No related works found for this paper.

Funding