Question about data normalization #9

TArdelean · 2024-11-11T13:39:00Z

Hello,
Firstly, thanks for the great research and sharing your implementation!
I have a question regarding data normalization when using the sd-vae-ft-mse latent encoder. In your source code you encode_pixels before applying the VAE, which scales the images in [0, 1]. However, as far as I can tell, the SD VAE was trained using a [-1, 1] normalization (huggingface/diffusers#3726 (comment)).
Could you please clarify this discrepancy?

Thanks,
Timotei

SonicCodes · 2024-11-17T20:37:48Z

edm2/training/encoders.py

Line 112 in 38d5a70

def encode_pixels(self, x): # raw pixels => raw latents

here's where the encoding happens to be exact , the OpenAI consistency paper apparently discovered this:

@tkarras @TArdelean

Thanks,

TArdelean · 2024-11-19T12:40:31Z

I see, that clears it up.
Thanks @SonicCodes

TArdelean closed this as completed Nov 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about data normalization #9

Question about data normalization #9

TArdelean commented Nov 11, 2024

SonicCodes commented Nov 17, 2024

TArdelean commented Nov 19, 2024

Question about data normalization #9

Question about data normalization #9

Comments

TArdelean commented Nov 11, 2024

SonicCodes commented Nov 17, 2024

TArdelean commented Nov 19, 2024