[WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps #3436

kathath · 2023-05-15T13:28:05Z

For the final DDIMScheduler step, we are already at t=0 and are predicting the sample at
t=0-num_train_steps/num_inference_steps.
The current DDIMInverseScheduler implemetation starts with the original image at t=0. But to match DDIMScheduler, one needs to start at t=0-num_train_steps/num_inference_steps in order to revert all denoising steps.
This can also be seen by the fact that there are num_inference_steps denoising steps but only num_inference_steps-1 noising steps in StableDiffusionPix2PixZeroPipeline and StableDiffusionDiffEditPipeline (those two pipelines are currently using DDIMInverseScheduler).

In contrast to the current DDIMInverseScheduler implementation, in Null-text Inversion for Editing Real Images using Guided Diffusion Models inversion starts at t=0-num_train_steps/num_inference_steps and reverses all denoising steps.
In the description of DDIMInverseScheduler, it is stated that "The implementation is mostly based on the DDIM inversion definition of Null-text Inversion for Editing Real Images using Guided Diffusion Models" - which is currently not quite the case.

This PR allows to use DDIM inversion as implemented in Null-text Inversion for Editing Real Images using Guided Diffusion Models, when setting revert_all_steps=True.

This PR is strongly influenced by https://github.com/google/prompt-to-prompt/#null-text-inversion-for-editing-real-images.

HuggingFaceDocBuilderDev · 2023-05-15T13:33:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

patrickvonplaten · 2023-05-17T09:03:28Z

Hmm @clarencechen wdyt here? It essentially reverts the PR here: #2619

clarencechen · 2023-05-29T01:52:36Z

@patrickvonplaten I wouldn't say it reverts #2619, since the DDIMInverseScheduler implementation before then was also dropping the last (most noisy) timestep, but it does revert the semantic interpretation of the timestamp argument away from "timestamp corresponding to current noise level of input", which I personally strongly prefer. I think I would wait until a fine-tuned model addressing #3475 is out, since the current timestep I'm skipping corresponds to the final steps_offset training timesteps of the DDIMScheduler denoising process.

…g denoising process

bghira · 2023-05-30T20:45:04Z

i have uploaded ptx0/pseudo-journey-v2 which is trained using a patched noise schedule. it has 30,000 steps applied over most of the text encoder for SD 2.1-v over about 30,000 images.

patrickvonplaten

Left some comments inline

patrickvonplaten · 2023-06-02T15:04:15Z

src/diffusers/schedulers/scheduling_ddim_inverse.py

-            each diffusion step uses the value of alphas product at that step and at the previous one. For the final
-            step there is no previous alpha. When this option is `True` the previous alpha product is fixed to `0`,
-            otherwise it uses the value of alpha at step `num_train_timesteps - 1`.
+        set_alpha_to_one (`bool`, default `True`):


Why do we have to do the renaming here? This is backwards breaking IMO

patrickvonplaten · 2023-06-02T15:04:44Z

src/diffusers/schedulers/scheduling_ddim_inverse.py

        steps_offset: int = 0,
        prediction_type: str = "epsilon",
        clip_sample_range: float = 1.0,
+        revert_all_steps: bool = False,


is this ever used?

github-actions · 2023-06-27T15:02:44Z

This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.

Please note that issues that do not follow the contributing guidelines are likely to be ignored.

kathath added 2 commits May 15, 2023 13:25

adapt DDIMInverseScheduler to match analogous forward process

94aa5b3

fix code formatting; do not modify timesteps tensor

fb9eb20

fix quality check

bea2f62

kathath added 2 commits May 29, 2023 17:40

start timesteps of inverse scheduler at last timestep of correspondin…

512d9cc

…g denoising process

Merge branch 'main' into improve-ddim-inverse-scheduler

cf0b649

kathath changed the title ~~Allow DDIMInverseScheduler to use same number of noising and denoising steps~~ [WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps May 30, 2023

Merge branch 'main' into improve-ddim-inverse-scheduler

d86fb91

patrickvonplaten reviewed Jun 2, 2023

View reviewed changes

clarencechen mentioned this pull request Jun 24, 2023

Add Recent Timestep Scheduling Improvements to DDIM Inverse Scheduler #3865

Merged

6 tasks

github-actions bot added the stale Issues that haven't received updates label Jun 27, 2023

github-actions bot closed this Jul 5, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps #3436

[WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps #3436

kathath commented May 15, 2023

HuggingFaceDocBuilderDev commented May 15, 2023

patrickvonplaten commented May 17, 2023

clarencechen commented May 29, 2023

bghira commented May 30, 2023

patrickvonplaten left a comment

patrickvonplaten Jun 2, 2023

patrickvonplaten Jun 2, 2023

github-actions bot commented Jun 27, 2023

[WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps #3436

[WIP] Allow DDIMInverseScheduler to use same number of noising and denoising steps #3436

Conversation

kathath commented May 15, 2023

HuggingFaceDocBuilderDev commented May 15, 2023

patrickvonplaten commented May 17, 2023

clarencechen commented May 29, 2023

bghira commented May 30, 2023

patrickvonplaten left a comment

Choose a reason for hiding this comment

patrickvonplaten Jun 2, 2023

Choose a reason for hiding this comment

patrickvonplaten Jun 2, 2023

Choose a reason for hiding this comment

github-actions bot commented Jun 27, 2023