DiffuSphere - An Object-Oriented Framework for Image Generation using Diffusion Models and their Variants.

DiffuSphere is a cutting-edge framework designed to streamline and enhance the process of image generation using diffusion models. Built with industry-standard coding methodologies, DiffuSphere ensures high scalability, maintainability, and efficient bug tracking, making it suitable for research and production environments.

The framework adopts a modular and object-oriented architecture, enabling developers to extend or customize its components effortlessly while maintaining code clarity and robustness.

With DiffuSphere, zero coding is required to generate images using the latest and most powerful diffusion model variants. The framework is designed to work out-of-the-box, allowing users to leverage advanced diffusion models without writing a single line of code.

Furthermore, integrating new models into the framework is incredibly straightforward, as 95% of the required code infrastructure is already in place. This allows researchers and developers to focus on innovation rather than boilerplate code.

Supported Models

The framework supports the training and evaluation of various diffusion models, including:

DDPM (Denoising Diffusion Probabilistic Models):
- A generative model that learns to iteratively reverse a noise process to generate high-quality samples.
DDPM with EMA (Exponential Moving Average):
- An enhanced DDPM model with weights updated with an EMA for better stability and improved sample quality.
DDPM CFG (Classifier-Free Guidance):
- A variant of DDPM that uses a guidance mechanism without a classifier to better control the generation process by amplifying desired features.
DDPM CFG EMA:
- Combines the benefits of CFG with EMA-based weight updates for even more stable training and higher-quality controlled generation.
CFG++ (Classifier-Free Guidance++):
- An advanced version of Classifier-Free Guidance that improves control and diversity in sample generation by refining the guidance mechanism.
CFG++ EMA:
- A CFG++ model incorporating EMA to further enhance stability and generation quality, particularly in fine-grained and high-detail samples.
DDPM Power Law EMA:
- A DDPM model that incorporates a power law decay schedule for the EMA updates, allowing for more precise adjustment of weights and improved long-term training stability.
DDPM CFG Power Law EMA:
- Combines the Classifier-Free Guidance mechanism with a power law decay EMA schedule, enabling better control of the generation process and producing high-quality outputs with enhanced stability.
DDPM CFG++ Power Law EMA:
- The most advanced model in the DiffuSphere suite, integrating CFG++, EMA, and a power law decay schedule. This model excels in controlled generation tasks, offering unparalleled stability and sample diversity.

Each of these models provides flexibility for various use cases, balancing control, stability, and sample quality according to the task's requirements. The modular design of DiffuSphere ensures seamless transitions between models and effortless integration of new variations.

Project Structure and Overview

---- Main Scripts ---->

call_methods.py: Handles the creation of datasets, networks, and models dynamically based on user specifications.
train.py: Main training script for running and managing the model training loops.

---- `data` Directory ---->

datasets.py: Defines the base class for datasets, including data loading and preprocessing functionalities.
mnist.py: Contains dataset classes for handling MNIST training and testing datasets.
topographies.py: Implements the BiologicalObservation dataset class for working with biological images and topographical data.

---- `model` Directory ---->

attention_block.py: Implements attention mechanisms for improving the UNet model's performance.
ddpm.py: Contains the implementation of Denoising Diffusion Probabilistic Models (DDPM) instance and its variants.
downsampling_block.py: Implements downsampling operations in the UNet model.
models.py: Defines the base class for all models, including training and saving mechanisms.
networks.py: Base class for networks, defining essential methods like forward pass and parameter counting.
nin_block.py: Implements Network-in-Network (NiN) blocks for feature extraction.
resnet_block.py: Defines residual blocks to facilitate deep feature extraction in the UNet.
timestep_embedding.py: Provides a time embedding mechanism for incorporating temporal information.
unet.py: Implements the UNet architecture tailored for diffusion models.
upsampling_block.py: Handles upsampling operations in the UNet.

---- `option` Directory ---->

base_options.py: Defines the base configuration options used across all experiments. It uses Python's argparse library to handle command-line arguments.
train_options.py: Extends base options by adding specific configuration settings for training machine learning models through the pipeline.
enums.py: Centralized location for managing constant values like dataset names, model types, and training modes.
config.py: defines configuration classes by utilising data classes module from Python for a structured and type-safe way to manage default parameters.

---- `utils` Directory ---->

images_utils.py: Provides utility functions for image transformations, including resizing and normalization.
utils.py: General utilities such as setting seeds for reproducibility and directory management.

How to Use

Follow these steps to set up and run DiffuSphere for training and image generation:

1. Install Requirements

Ensure you have all the dependencies installed. Run the following command:

pip install -r requirements.txt

2. Clone the Repository

Clone the DiffuSphere repository to your local machine:

git clone https://github.com/yourusername/DiffuSphere.git
cd DiffuSphere

3. Create a Repository for Loading Data

Prepare a directory structure for your data:

Place your training data (e.g., images, labels) in a new directory, preferably located outside the main DiffuSphere repository (to handle large datasets effectively). For example:

mkdir -p /path/to/large_dataset_repo

Configure the dataset path in the base_options.py file:

Example: Modify the dataset path in base_options.py

--image_folder = "/path/to/large_dataset_repo/images"
--label_path = "/path/to/large_dataset_repo/labels.csv"

Also DiffuSphere is highly configurable via flags and script modifications:

Flags in base_options.py and train_options.py: Dataset parameters like --dataset_name, image size, batch size, and more. Training parameters such as learning rate, number of epochs, optimizer type.

4. Start Training

Run the training process:

python train.py

Optional: Advanced Flag Management with train.sh

If you have multiple flags to modify or want a streamlined way to manage configurations:

Open launch and edit the train.sh file:

Example:

python train.py \
--images_folder '../../Dataset/Topographies/raw/FiguresStacked 8X8_4X4_2X2 Embossed' \
--label_path '../../Dataset/biology_data/TopoChip/MacrophageWithClass.csv' \
--dataset_name 'biological' \
--n_epochs 40000 \
--img_size 64 \
--batch_size 32 \
--num_workers 4 \

# Add the parameters and values accordingly

Run the script:

./train.sh

5. Predict

Readme for predict will be updated soon. Apologies!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiffuSphere - An Object-Oriented Framework for Image Generation using Diffusion Models and their Variants.

Supported Models

Project Structure and Overview

---- Main Scripts ---->

---- `data` Directory ---->

---- `model` Directory ---->

---- `option` Directory ---->

---- `utils` Directory ---->

How to Use

1. Install Requirements

2. Clone the Repository

3. Create a Repository for Loading Data

4. Start Training

Optional: Advanced Flag Management with train.sh

5. Predict

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
data		data
launch		launch
model		model
option		option
utils		utils
.gitignore		.gitignore
README.md		README.md
call_methods.py		call_methods.py
train.py		train.py

Karthi-DStech/DiffuSphere-Object-Oriented-Framework-

Folders and files

Latest commit

History

Repository files navigation

DiffuSphere - An Object-Oriented Framework for Image Generation using Diffusion Models and their Variants.

Supported Models

Project Structure and Overview

---- Main Scripts ---->

---- data Directory ---->

---- model Directory ---->

---- option Directory ---->

---- utils Directory ---->

How to Use

1. Install Requirements

2. Clone the Repository

3. Create a Repository for Loading Data

4. Start Training

Optional: Advanced Flag Management with train.sh

5. Predict

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

---- `data` Directory ---->

---- `model` Directory ---->

---- `option` Directory ---->

---- `utils` Directory ---->

Packages