Activating More Pixels in Image Super-Resolution Transformer
City University of Macau · University of Macau · +5 more institutions
Abstract
Transformer-based methods have shown impressive performance in low-level vision tasks, such as image super-resolution. However, we find that these networks can only utilize a limited spatial range of input information through attribution analysis. This implies that the potential of Transformer is still not fully exploited in existing networks. In order to activate more input pixels for better reconstruction, we propose a novel Hybrid Attention Transformer (HAT). It combines both channel attention and window-based self-attention schemes, thus making use of their complementary advantages of being able to utilize global statistics and strong local fitting capability. Moreover, to better aggregate the cross-window…
Citation impact
- FWCI
- 110.24
- Percentile
- 100%
- References
- 88
Authors
5- XCXiangyu ChenCorresponding
City University of Macau, University of Macau, Shenzhen Institutes of Advanced Technology, Shanghai Artificial Intelligence Laboratory, Beijing Academy of Artificial Intelligence, Chinese Academy of Sciences
- XWXintao Wang
Tencent (China)
- JZJiantao Zhou
City University of Macau, University of Macau
- YQYu Qiao
Beijing Academy of Artificial Intelligence, Shanghai Artificial Intelligence Laboratory, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences
- CDChao Dong
Chinese Academy of Sciences, Shenzhen Institutes of Advanced Technology, Shanghai Artificial Intelligence Laboratory, Beijing Academy of Artificial Intelligence
Topics & keywords
- Computer science
- Transformer
- Exploit
- Pixel
- Artificial intelligence
- Image resolution
- Pattern recognition (psychology)
- Computer vision