CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

📰 News

We are excited to announce that our paper on Efficient Video Restoration, titled Learning Truncated Causal History Model for Video Restoration (GitHub Repository), has been accepted to NeurIPS 2024! 🔥

The official PyTorch implementation of the paper

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration
Amirhosein Ghasemabadi, Muhammad Kamran Janjua, Mohammad Salameh, Chunhua Zhou, Fengyu Sun, Di Niu
Accepted at Transactions on Machine Learning Research (TMLR), 2024.

Installation

This implementation is based on BasicSR which is an open-source toolbox for image/video restoration tasks, NAFNet, Restormer and Multi Output Deblur

python 3.9.5
pytorch 1.11.0
cuda 11.3

pip install -r requirements.txt
python setup.py develop --no_cuda_ext

Quick Start

We have provided demo-denoising.ipynb to show how to load images from the validation dataset and use the model to restore images.

CascadedGaze implementation

The implementation of our proposed CascadedGaze Net, CascadedGaze block, and the Global Context Extractor module can be found in /CascadedGaze/basicsr/models/archs/CGNet_arch.py

The implementation of Multi-Head CascadedGaze Net can be found in /CascadedGaze/basicsr/models/archs/CGNetMultiHead_arch.py

Denoising on SIDD

1. Data Preparation

Download the train set(from the SIDD dataset website) and place it in `./datasets/SIDD/Data/`,

Download the evaluation data in lmdb format (from the SIDD dataset website) and place it in `./datasets/SIDD/test/`:

After downloading, it should be like this:

./datasets/
└── SIDD/
    ├── Data/
    │   ├── 0001
    │   │   ├── GT_SRGB.PNG
    │   │   ├── NOISY_SRGB.PNG
    │   │   ....
    │   └── 0200
    │       ├── GT_SRGB.PNG
    │       ├── NOISY_SRGB.PNG    
    ├── train/
    └── test/
        ├── input.imdb
        └── target.imdb

Use python scripts/data_preparation/sidd.py to crop the train image pairs to 512x512 patches and make the data into lmdb format. the processed images will be saved in ./datasets/SIDD/train/

2. Training

To train the CascadedGaze model:

python -m torch.distributed.launch --nproc_per_node=8 --master_port=8081 basicsr/train.py -opt options/train/SIDD/CascadedGaze-SIDD.yml --launcher pytorch

3. Evaluation

Note: Due to the file size limitation, we are not able to share the pre-trained models in this code submission. However, they will be provided with an open-source release of the code.

Testing the model

To evaluate the pre-trained model use this command:

python -m torch.distributed.launch --nproc_per_node=8 --master_port=8080 basicsr/test.py -opt ./options/test/SIDD/CascadedGaze-SIDD.yml --launcher pytorch

4. Model complexity and inference speed

To get the parameter count, MAC, and inference speed use this command:

python CascadedGaze/basicsr/models/archs/CGNet_arch.py

Gaussian Image denoising

1. Data Preparation

Clone the Restormer's github project and follow their instructions the download the train and test datasets.

2. Training

To train the CascadedGaze model follow these steps:

Copy the /CascadedGaze/basicsr/models/archs/CGNet_Guassian_arch.py to /Restormer/basicsr/models/archs/
Copy the training option files from /CascadedGaze/options/train/Gaussian/ to /Restormer/Denoising/Options/
Follow Restormer's training instructions and train models on different noise levels

3. Evaluation

Note: Pretrained models will be released soon.

Testing the model

To evaluate the pre-trained model, start by adjusting the noise level (sigma=15, 25 or 50), the paths to the trained model, and the training option file within the code. Once modified, execute the following command.

python CascadedGaze/basicsr/test_gaussian_color_denoising.py

4. Model complexity and inference speed

To get the parameter count, MAC, and inference speed use this command:

python CascadedGaze/basicsr/models/archs/CGNet_Guassian_arch.py

Deblurring on GoPro

1. Data Preparation

Download the train set(from the Gopro dataset website) and place it in `./datasets/GoPro/train`,

Download the evaluation data in lmdb format (from the Gopro dataset website) and place it in `./datasets/GoPro/test/`:

After downloading, it should be like this:

./datasets/
└── GoPro/
    ├── train/
    │   ├── input/
    │   └── target/
    └── test/
        ├── input.imdb
        └── target.imdb

python scripts/data_preparation/gopro.py to crop the train image pairs to 512x512 patches and make the data into lmdb format.

2. Training

To train the CascadedGaze Multihead model:

python -m torch.distributed.launch --nproc_per_node=8 --master_port=8081 basicsr/train.py -opt options/train/GoPro/CascadedGazeMH-GoPro.yml --launcher pytorch

To finetune the trained CascadedGazeMH model on larger patches

python -m torch.distributed.launch --nproc_per_node=8 --master_port=8081 basicsr/train.py -opt options/train/GoPro/CascadedGazeMH-GoPro-fintune_largerPatch.yml --launcher pytorch

3. Evaluation

Note: Pretrained models will be released soon.

Testing the model

To evaluate the pre-trained model use this command:

python -m torch.distributed.launch --nproc_per_node=8 --master_port=8080 basicsr/test.py -opt ./options/test/GoPro/CascadedGazeMH-GoPro.yml --launcher pytorch

4. Model complexity and inference speed

To get the parameter count, MAC, and inference speed use this command:

python CascadedGaze/basicsr/models/archs/CGNetMultiHead_arch.py

Visualizing the training logs

You can use Tensorboard to track the training status:

tensorboard --logdir=/CascadedGaze/logs

Citation

If you use CascadedGaze, or this codebase in your work, please consider citing this work:

@article{
ghasemabadi2024cascadedgaze,
title={CascadedGaze: Efficiency in Global Context Extraction for Image Restoration},
author={Amirhosein Ghasemabadi and Muhammad Kamran Janjua and Mohammad Salameh and CHUNHUA ZHOU and Fengyu Sun and Di Niu},
journal={Transactions on Machine Learning Research},
issn={2835-8856},
year={2024},
url={https://openreview.net/forum?id=C3FXHxMVuq},
note={}
}

Files

README.md

Latest commit

History

README.md

File metadata and controls

CascadedGaze: Efficiency in Global Context Extraction for Image Restoration

📰 News

Installation

Quick Start

CascadedGaze implementation

Denoising on SIDD

1. Data Preparation

Download the train set(from the SIDD dataset website) and place it in ./datasets/SIDD/Data/,

Download the evaluation data in lmdb format (from the SIDD dataset website) and place it in ./datasets/SIDD/test/:

After downloading, it should be like this:

2. Training

3. Evaluation

Note: Due to the file size limitation, we are not able to share the pre-trained models in this code submission. However, they will be provided with an open-source release of the code.

Testing the model

4. Model complexity and inference speed

Gaussian Image denoising

1. Data Preparation

Clone the Restormer's github project and follow their instructions the download the train and test datasets.

2. Training

To train the CascadedGaze model follow these steps:

3. Evaluation

Note: Pretrained models will be released soon.

Testing the model

4. Model complexity and inference speed

Deblurring on GoPro

1. Data Preparation

Download the train set(from the Gopro dataset website) and place it in ./datasets/GoPro/train,

Download the evaluation data in lmdb format (from the Gopro dataset website) and place it in ./datasets/GoPro/test/:

After downloading, it should be like this:

2. Training

3. Evaluation

Note: Pretrained models will be released soon.

Testing the model

4. Model complexity and inference speed

Visualizing the training logs

Citation

Download the train set(from the SIDD dataset website) and place it in `./datasets/SIDD/Data/`,

Download the evaluation data in lmdb format (from the SIDD dataset website) and place it in `./datasets/SIDD/test/`:

Download the train set(from the Gopro dataset website) and place it in `./datasets/GoPro/train`,

Download the evaluation data in lmdb format (from the Gopro dataset website) and place it in `./datasets/GoPro/test/`: