You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hello,
Firstly, thanks for the great research and sharing your implementation!
I have a question regarding data normalization when using the sd-vae-ft-mse latent encoder. In your source code you encode_pixels before applying the VAE, which scales the images in [0, 1]. However, as far as I can tell, the SD VAE was trained using a [-1, 1] normalization (huggingface/diffusers#3726 (comment)).
Could you please clarify this discrepancy?
Thanks,
Timotei
The text was updated successfully, but these errors were encountered:
Hello,
Firstly, thanks for the great research and sharing your implementation!
I have a question regarding data normalization when using the sd-vae-ft-mse latent encoder. In your source code you encode_pixels before applying the VAE, which scales the images in [0, 1]. However, as far as I can tell, the SD VAE was trained using a [-1, 1] normalization (huggingface/diffusers#3726 (comment)).
Could you please clarify this discrepancy?
Thanks,
Timotei
The text was updated successfully, but these errors were encountered: