articleJun 1, 2023Closed access

Learned Image Compression with Mixed Transformer-CNN Architectures

Waseda University · Shanghai Jiao Tong University

Indexed incrossref

Abstract

Learned image compression (LIC) methods have exhibited promising progress and superior rate-distortion performance compared with classical image compression standards. Most existing LIC methods are Convolutional Neural Networks-based (CNN-based) or Transformer-based, which have different advantages. Exploiting both advantages is a point worth exploring, which has two challenges: 1) how to effectively fuse the two methods? 2) how to achieve higher performance with a suitable complexity? In this paper, we propose an efficient parallel Transformer-CNN Mixture (TCM) block with a controllable complexity to incorporate the local modeling ability of CNN and the non-local modeling ability of transformers to improve…

Citation impact

336
total citations
FWCI
38.19
Percentile
100%
References
44
Citations per year

Authors

3

Topics & keywords

Keywords
  • Computer science
  • Convolutional neural network
  • Transformer
  • Image compression
  • Entropy (arrow of time)
  • Artificial intelligence
  • Computer engineering
  • Algorithm
No related works found for this paper.

Funding