Learning A Sparse Transformer Network for Effective Image Deraining
Nanjing University of Science and Technology · China Electronics Technology Group Corporation
Abstract
Transformers-based methods have achieved significant performance in image deraining as they can model the non-local information which is vital for high-quality image reconstruction. In this paper, we find that most existing Transformers usually use all similarities of the tokens from the query-key pairs for the feature aggregation. However, if the tokens from the query are different from those of the key, the self-attention values estimated from these tokens also involve in feature aggregation, which accordingly interferes with the clear image restoration. To overcome this problem, we propose an effective DeRaining network, Sparse Transformer (DRSformer) that can adaptively keep the most useful self-attention…
Citation impact
- FWCI
- 48.15
- Percentile
- 100%
- References
- 73
Authors
4Topics & keywords
- Computer science
- Transformer
- Artificial intelligence
- Pattern recognition (psychology)
- Feature (linguistics)
- Source code
- Data mining
- Machine learning