FLatten Transformer: Vision Transformer using Focused Linear Attention

Han, Dongchen; Pan, Xuran; Han, Yizeng; Song, Shiji; Huang, Gao

doi:10.1109/iccv51070.2023.00548

articleOct 1, 2023Closed access

FLatten Transformer: Vision Transformer using Focused Linear Attention

DHDongchen Han XPXuran Pan YHYizeng Han SSShiji Song GHGao Huang

Tsinghua University

Indexed incrossref

Abstract

The quadratic computation complexity of self-attention has been a persistent challenge when applying Transformer models to vision tasks. Linear attention, on the other hand, offers a much more efficient alternative with its linear complexity by approximating the Softmax operation through carefully designed mapping functions. However, current linear attention approaches either suffer from significant performance degradation or introduce additional computation overhead from the mapping functions. In this paper, we propose a novel Focused Linear Attention module to achieve both high efficiency and expressiveness. Specifically, we first analyze the factors contributing to the performance degradation of linear…

Citation impact

275

total citations

FWCI: 31.29
Percentile: 100%
References: 74

Citations per year

Authors

5

Topics & keywords

Topics

Keywords

Computer science
Computation
Transformer
Quadratic equation
Computer engineering
Algorithm
Theoretical computer science
Mathematics

No related works found for this paper.

Funding

NN
National Natural Science Foundation of China