Jie Xiao, Xueyang Fu, Aiping Liu, Feng Wu, Zheng-Jun Zha
- 2022.09.21: Add test code for full size images: test_full_size.py
Abstract: Existing deep learning based de-raining approaches have resorted to the convolutional architectures. However, the intrinsic limitations of convolution, including local receptive fields and independence of input content, hinder the model's ability to capture long-range and complicated rainy artifacts. To overcome these limitations, we propose an effective and efficient transformer-based architecture for the image de-raining. Firstly, we introduce general priors of vision tasks, i.e., locality and hierarchy, into the network architecture so that our model can achieve excellent de-raining performance without costly pre-training. Secondly, since the geometric appearance of rainy artifacts is complicated and of significant variance in space, it is essential for de-raining models to extract both local and non-local features. Therefore, we design the complementary window-based transformer and spatial transformer to enhance locality while capturing long-range dependencies. Besides, to compensate for the positional blindness of self-attention, we establish a separate representative space for modeling positional relationship, and design a new relative position enhanced multi-head self-attention. In this way, our model enjoys powerful abilities to capture dependencies from both content and position, so as to achieve better image content recovery while removing rainy artifacts. Experiments substantiate that our approach attains more appealing results than state-of-the-art methods quantitatively and qualitatively.
To evaluate the image with arbitrary size, we first split the image to overlapped
To test the pre-trained IDT model on full size images:
python test_full_size.py --arch IDT --save_path your_save_path --data_path your_input_path --weights pretrained_weights
For research convinience, we release both derained patches as well as full-size images.
- Derain results Google Drive
- Rain200H/Rain200L/DDN/DID/SPA: evaluaterain.m
- Raindrop: evaluate_raindrop.py
Dataset | Rain200H | Rain200L | DDN-Data | DID-Data | SPA-Data | AGAN-Data |
---|---|---|---|---|---|---|
PSNR | 32.10 | 40.74 | 33.80 | 34.85 | 47.34 | 31.63 |
SSIM | 0.9343 | 0.9884 | 0.9407 | 0.9401 | 0.9929 | 0.9360 |
- Derain Google Drive
If you use IDT, please consider citing:
@article{xiao2022image,
title={Image De-raining Transformer},
author={Xiao, Jie and Fu, Xueyang and Liu, Aiping and Wu, Feng and Zha, Zheng-Jun},
journal={IEEE Transactions on Pattern Analysis and Machine Intelligence},
year={2022},
publisher={IEEE}
}
This code is based on the Uformer. The evaluation code borrows from Restormer and AttentGAN.
Please contact us if there is any question([email protected]).