articleComputational Visual MediaJul 28, 2023DIAMOND OA

Visual attention network

Tsinghua University · Nankai University

Indexed incrossrefdoaj

Abstract

While originally designed for natural language processing tasks, the self-attention mechanism has recently taken various computer vision areas by storm. However, the 2D nature of images brings three challenges for applying self-attention in computer vision: (1) treating images as 1D sequences neglects their 2D structures; (2) the quadratic complexity is too expensive for high-resolution images; (3) it only captures spatial adaptability but ignores channel adaptability. In this paper, we propose a novel linear attention named large kernel attention (LKA) to enable self-adaptive and long-range correlations in self-attention while avoiding its shortcomings. Furthermore, we present a neural network based on LKA,…

Citation impact

958
total citations
FWCI
105.17
Percentile
100%
References
103
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Artificial intelligence
  • Segmentation
  • Convolutional neural network
  • Object detection
  • Benchmark (surveying)
  • Pattern recognition (psychology)
  • Machine learning
UN Sustainable Development Goals
  • Quality Education
No related works found for this paper.

Funding