some question about pixel generator #1

miganchuanbo · 2023-12-07T09:53:54Z

Thanks for the excellent work. I am a bit confusing about the Fig 3(b). In the fig, the original image and the representation are sent to the pixel generator. I am just wondering if it is ok to exclude the original image (just the representation).

LTH14 · 2023-12-07T11:44:14Z

Thanks for your interest. Please note that Fig 3(b) is to illustrate the pixel generator's training phase. Most current generative frameworks, such as MAGE and LDM, either partially mask or add noise to the original image, and ask the model to reconstruct the original image during training. In FIg 3(b), we take MAGE as an example, which first tokenizes the image into image tokens and then masks some of the tokens. Therefore, the original image is needed as the input of the training phase. However, we do not need the original image during generation -- generation starts from a 100% masked image (MAGE), or Gaussian noise (LDM/ADM), conditioned on only the representation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

some question about pixel generator #1

some question about pixel generator #1

miganchuanbo commented Dec 7, 2023

LTH14 commented Dec 7, 2023 •

edited

Loading

some question about pixel generator #1

some question about pixel generator #1

Comments

miganchuanbo commented Dec 7, 2023

LTH14 commented Dec 7, 2023 • edited Loading

LTH14 commented Dec 7, 2023 •

edited

Loading