Learned Image Compression with Mixed Transformer-CNN Architectures
Waseda University · Shanghai Jiao Tong University
Abstract
Learned image compression (LIC) methods have exhibited promising progress and superior rate-distortion performance compared with classical image compression standards. Most existing LIC methods are Convolutional Neural Networks-based (CNN-based) or Transformer-based, which have different advantages. Exploiting both advantages is a point worth exploring, which has two challenges: 1) how to effectively fuse the two methods? 2) how to achieve higher performance with a suitable complexity? In this paper, we propose an efficient parallel Transformer-CNN Mixture (TCM) block with a controllable complexity to incorporate the local modeling ability of CNN and the non-local modeling ability of transformers to improve…
Citation impact
- FWCI
- 38.19
- Percentile
- 100%
- References
- 44
Authors
3Topics & keywords
- Computer science
- Convolutional neural network
- Transformer
- Image compression
- Entropy (arrow of time)
- Artificial intelligence
- Computer engineering
- Algorithm