articleOct 1, 2023Closed access

EfficientViT: Lightweight Multi-Scale Attention for High-Resolution Dense Prediction

Moscow Institute of Thermal Technology · Zhejiang University · +2 more institutions

Indexed incrossref

Abstract

High-resolution dense prediction enables many appealing real-world applications, such as computational photography, autonomous driving, etc. However, the vast computational cost makes deploying state-of-the-art high-resolution dense prediction models on hardware devices difficult. This work presents EfficientViT, a new family of high-resolution vision models with novel lightweight multi-scale attention. Unlike prior high-resolution dense prediction models that rely on heavy self-attention, hardware-inefficient large-kernel convolution, or complicated topology structure to obtain good performances, our lightweight multi-scale attention achieves a global receptive field and multi-scale learning (two critical…

Citation impact

261
total citations
FWCI
29.71
Percentile
100%
References
74
Citations per year

Authors

5

Topics & keywords

Keywords
  • Computer science
  • Speedup
  • Kernel (algebra)
  • Field-programmable gate array
  • Cloud computing
  • High resolution
  • Supercomputer
  • Computational photography
UN Sustainable Development Goals
  • Sustainable cities and communities
No related works found for this paper.