Vision Transformers for Single Image Dehazing
Zhejiang University of Science and Technology · Zhejiang University
Abstract
Image dehazing is a representative low-level vision task that estimates latent haze-free images from hazy images. In recent years, convolutional neural network-based methods have dominated image dehazing. However, vision Transformers, which has recently made a breakthrough in high-level vision tasks, has not brought new dimensions to image dehazing. We start with the popular Swin Transformer and find that several of its key designs are unsuitable for image dehazing. To this end, we propose DehazeFormer, which consists of various improvements, such as the modified normalization layer, activation function, and spatial information aggregation scheme. We train multiple variants of DehazeFormer on various datasets…
Citation impact
- FWCI
- 109.38
- Percentile
- 100%
- References
- 93
Authors
4Topics & keywords
- Computer science
- Artificial intelligence
- Normalization (sociology)
- Transformer
- Computer vision
- Convolutional neural network
- Pattern recognition (psychology)