Add multistep DPM-Solver discrete scheduler #1132

LuChengTHU · 2022-11-04T12:34:14Z

Add the multistep version of DPM-Solver, accepting discrete time steps as DDPM, DDIM, and PNDM.

The solver can successfully run with the stable-diffusion pipeline for both pytorch and jax versions.

Currently, it supports the following algorithms:

multistep DPM-Solver (for discretizing the integral w.r.t. noise prediction model) with order = 1, 2, 3
multistep DPM-Solver++ (for discretizing the integral w.r.t. data prediction model) with order = 1, 2, 3

For more details of the algorithms, please refer to https://github.com/LuChengTHU/dpm-solver

#1101

HuggingFaceDocBuilderDev · 2022-11-04T12:37:55Z

The documentation is not available anymore as the PR was closed or merged.

patil-suraj

Great work @LuChengTHU, super cool PR!

The PR already looks super good, I just left some nits.

Also tried it out with stable diffusion and it works really well, and generates really good output with 20-25 steps, for some examples even 10-15 🤯 !

QQ: What are the recommended setting for stable diffusion, I used the following and it worked super well. We will need to update these in the config of sd checkpoints.

dpm = DPMSolverDiscreteScheduler.from_config("CompVis/stable-diffusion-v1-4", subfolder="scheduler", solver_order=2, predict_x0=False, denoise_final=True)

And is this scheduler different from the DPM version which requires two model evaluation per step ?

We are still thinking about an API for schedulers that require two model evaluations. Will have a draft PR next week. Then we can add the other version of DPM once the API is finalised.

Also, to differentiate it from the other scheduler could we call this DPMMultiStepScheduler ?

src/diffusers/__init__.py

src/diffusers/schedulers/scheduling_dpmsolver_discrete.py

src/diffusers/schedulers/scheduling_dpmsolver_discrete_flax.py

tests/test_config.py

tests/test_scheduler.py

LuChengTHU · 2022-11-04T17:37:18Z

Great work @LuChengTHU, super cool PR!

The PR already looks super good, I just left some nits.

Also tried it out with stable diffusion and it works really well, and generates really good output with 20-25 steps, for some examples even 10-15 🤯 !

QQ: What are the recommended setting for stable diffusion, I used the following and it worked super well. We will need to update these in the config of sd checkpoints.
dpm = DPMSolverDiscreteScheduler.from_config("CompVis/stable-diffusion-v1-4", subfolder="scheduler", solver_order=2, predict_x0=False, denoise_final=True)
And is this scheduler different from the DPM version which requires two model evaluation per step ?

We are still thinking about an API for schedulers that require two model evaluations. Will have a draft PR next week. Then we can add the other version of DPM once the API is finalised.

Also, to differentiate it from the other scheduler could we call this DPMMultiStepScheduler ?

Thank you very much for the quick and high-quality review!

Q1: What are the recommended settings for stable diffusion?

I use the following setting, which works similarly to your settings:

dpm = DPMSolverMultistepScheduler.from_config(
    "CompVis/stable-diffusion-v1-4",  # or use the v1-5 version
    subfolder="scheduler",
    solver_order=2,
    predict_epsilon=True,
    thresholding=False,
    algorithm_type="dpmsolver++",
    solver_type="midpoint",
    denoise_final=True,  # the influence of this trick is effective for small (e.g. <=10) steps
)

Q2: is this scheduler different from the DPM version, which requires two model evaluations per step?

Yes, they are different. The DPM version you mentioned is actually the singlestep DPM-Solver in my repo, which is proposed by my paper "DPM-Solver". The currently committed PR version is the multistep DPM-Solver in my repo, which is proposed by my other paper "DPM-Solver++".

In the paper "DPM-Solver++", I carefully compare the different settings of the solvers for guided sampling (i.e. conditional sampling) by diffusion models, and I find that:

All previous high-order solvers are unstable and fail to generate samples for large guidance scales and small steps (<20).
We further proposed algorithms that discretize the integral w.r.t. the data prediction model, and can greatly stabilize the sample quality for small steps (<20). We use it as algorithm_type="dpmsolver++".
We find that for large guidance scales, higher-order solvers are extremely unstable. So we only use the 2nd-order solver for guided sampling.
We find that the multistep version is slightly better than the singlestep version.

Therefore, I suggest using the 2nd-order multistep DPM-Solver++, which is the above default settings.

However, the 3rd-order method is still useful for unconditional sampling, and it can achieve a better sample quality than the 2nd-order method for unconditional sampling. So I also provide a 3rd-order method in this PR.

References:
[1] DPM-Solver: https://arxiv.org/abs/2206.00927
[2] DPM-Solver++: https://arxiv.org/abs/2211.01095

Q3: We are still thinking about an API for schedulers that require two model evaluations. Will have a draft PR next week. Then we can add the other version of DPM once the API is finalised.

Great to know it! I can help to add the corresponding singlestep DPM-Solver at that time.

Q4: To differentiate it from the other scheduler could we call this DPMMultiStepScheduler ?

I've changed the name by "DPMSolverMultistepScheduler"

Q5: other modifications.

Thanks for your careful review! I will fix them in the latter commits.

patil-suraj · 2022-11-06T12:11:37Z

Thanks a lot for the detailed answer and addressing the review comments @LuChengTHU ! Good to merge for me :)

@patrickvonplaten or @anton-l would be nice if you could also take quick look :)

src/diffusers/utils/dummy_torch_and_accelerate_objects.py

Co-authored-by: Suraj Patil <[email protected]>

patrickvonplaten

Amazing addition @LuChengTHU!

The API choices are great and it's amazing that it fits so well with the existing API of Stable Diffusion :-)

Only left a couple of nits:

suggestion to rename predict_x0 to predict_epsilon for consistency
Add the DPM-Solver scheduler as a compatible scheduler to all other schedulers

src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

tests/test_scheduler.py

src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

src/diffusers/schedulers/scheduling_dpmsolver_multistep_flax.py

src/diffusers/pipelines/stable_diffusion/pipeline_stable_diffusion.py

src/diffusers/schedulers/scheduling_dpmsolver_multistep.py

LuChengTHU · 2022-11-06T15:20:08Z

Hi @patrickvonplaten, thank you for the quick reviews! I've fixed all the modifications. The main difference is that I've changed the APIs for predict_epsilon, algorithm_type, and solver_type to a clearer way (and also changed the docs).

Looking forward to your reply!

patil-suraj · 2022-11-06T21:49:49Z

Thank you for addressing the comments, let's go!

* add dpmsolver discrete pytorch scheduler * fix some typos in dpm-solver pytorch * add dpm-solver pytorch in stable-diffusion pipeline * add jax/flax version dpm-solver * change code style * change code style * add docs * add `add_noise` method for dpmsolver * add pytorch unit test for dpmsolver * add dummy object for pytorch dpmsolver * Update src/diffusers/schedulers/scheduling_dpmsolver_discrete.py Co-authored-by: Suraj Patil <[email protected]> * Update tests/test_config.py Co-authored-by: Suraj Patil <[email protected]> * Update tests/test_config.py Co-authored-by: Suraj Patil <[email protected]> * resolve the code comments * rename the file * change class name * fix code style * add auto docs for dpmsolver multistep * add more explanations for the stabilizing trick (for steps < 15) * delete the dummy file * change the API name of predict_epsilon, algorithm_type and solver_type * add compatible lists Co-authored-by: Suraj Patil <[email protected]>

This was referenced Nov 4, 2022

Is there a diffuser version? LuChengTHU/dpm-solver#9

Closed

[Feature Request]: Add newest DPM-Solver++ AUTOMATIC1111/stable-diffusion-webui#4280

Closed

Add the newest DPM-Solver crowsonkb/k-diffusion#40

Closed

LuChengTHU force-pushed the multistep-dpm-solver-scheduler branch from d8907c7 to bb055d9 Compare November 4, 2022 14:44

patil-suraj self-assigned this Nov 4, 2022

patil-suraj approved these changes Nov 4, 2022

View reviewed changes

patil-suraj requested review from anton-l, patrickvonplaten and pcuenca November 4, 2022 16:31

keturn mentioned this pull request Nov 5, 2022

add k_dpmpp_2_a and k_dpmpp_2 solvers options invoke-ai/InvokeAI#1389

Merged

This was referenced Nov 6, 2022

Stabilize the sampling of DPM-Solver++2M by a stabilizing trick crowsonkb/k-diffusion#43

Open

[Feature Request]: Stabilize the sampling of DPM-Solver++ by a stabilizing trick AUTOMATIC1111/stable-diffusion-webui#4377

Closed

patil-suraj reviewed Nov 6, 2022

View reviewed changes

src/diffusers/utils/dummy_torch_and_accelerate_objects.py Outdated Show resolved Hide resolved

LuChengTHU and others added 14 commits November 6, 2022 20:38

add dpmsolver discrete pytorch scheduler

e648d3d

fix some typos in dpm-solver pytorch

2fe70df

add dpm-solver pytorch in stable-diffusion pipeline

dc2d548

add jax/flax version dpm-solver

b71ec00

change code style

92ae949

change code style

c06d554

add docs

dc3d496

add add_noise method for dpmsolver

9fb0acb

add pytorch unit test for dpmsolver

d657d43

add dummy object for pytorch dpmsolver

00e8632

Update src/diffusers/schedulers/scheduling_dpmsolver_discrete.py

6a1f834

Co-authored-by: Suraj Patil <[email protected]>

Update tests/test_config.py

4843fc3

Co-authored-by: Suraj Patil <[email protected]>

Update tests/test_config.py

31ed110

Co-authored-by: Suraj Patil <[email protected]>

resolve the code comments

864d0bb

LuChengTHU added 6 commits November 6, 2022 20:38

rename the file

e9f0fbc

change class name

bc2afd5

fix code style

7c7c2ec

add auto docs for dpmsolver multistep

a6efda1

add more explanations for the stabilizing trick (for steps < 15)

5566a2b

delete the dummy file

3ac6ab4

LuChengTHU force-pushed the multistep-dpm-solver-scheduler branch from a497362 to 3ac6ab4 Compare November 6, 2022 12:40

patrickvonplaten approved these changes Nov 6, 2022

View reviewed changes

LuChengTHU added 2 commits November 6, 2022 23:05

change the API name of predict_epsilon, algorithm_type and solver_type

f54cf99

add compatible lists

dee238f

patil-suraj merged commit b4a1ed8 into huggingface:main Nov 6, 2022

LuChengTHU deleted the multistep-dpm-solver-scheduler branch November 7, 2022 07:03

LuChengTHU mentioned this pull request Nov 7, 2022

Efficient sampling via DPM-Solver lucidrains/imagen-pytorch#171

Open

averad mentioned this pull request Nov 11, 2022

Cannot import name 'DPMSolverMultistepScheduler' from 'diffusers' #1260

Closed

ClashSAN mentioned this pull request Mar 10, 2023

which scheduler is "DPM++ 2s a Karras" and "DPM++ 2M Karras" ? #2635

Closed

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add multistep DPM-Solver discrete scheduler #1132

Add multistep DPM-Solver discrete scheduler #1132

LuChengTHU commented Nov 4, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 4, 2022 •

edited

Loading

patil-suraj left a comment

LuChengTHU commented Nov 4, 2022 •

edited

Loading

patil-suraj commented Nov 6, 2022

patrickvonplaten left a comment

LuChengTHU commented Nov 6, 2022 •

edited

Loading

patil-suraj commented Nov 6, 2022

Add multistep DPM-Solver discrete scheduler #1132

Add multistep DPM-Solver discrete scheduler #1132

Conversation

LuChengTHU commented Nov 4, 2022 • edited Loading

HuggingFaceDocBuilderDev commented Nov 4, 2022 • edited Loading

patil-suraj left a comment

Choose a reason for hiding this comment

LuChengTHU commented Nov 4, 2022 • edited Loading

Q1: What are the recommended settings for stable diffusion?

Q2: is this scheduler different from the DPM version, which requires two model evaluations per step?

Q3: We are still thinking about an API for schedulers that require two model evaluations. Will have a draft PR next week. Then we can add the other version of DPM once the API is finalised.

Q4: To differentiate it from the other scheduler could we call this DPMMultiStepScheduler ?

Q5: other modifications.

patil-suraj commented Nov 6, 2022

patrickvonplaten left a comment

Choose a reason for hiding this comment

LuChengTHU commented Nov 6, 2022 • edited Loading

patil-suraj commented Nov 6, 2022

LuChengTHU commented Nov 4, 2022 •

edited

Loading

HuggingFaceDocBuilderDev commented Nov 4, 2022 •

edited

Loading

LuChengTHU commented Nov 4, 2022 •

edited

Loading

LuChengTHU commented Nov 6, 2022 •

edited

Loading