Random noise z augmentation with cGAN #152

aabobakr · 2017-11-21T02:23:49Z

Thanks for sharing this great work.

Conditional GANs (cGANs) learn a mapping from observed image x and random noise vector z to y: y = f(x, z). I am wondering how z is augmented on the input x for the generator. In the code, x is passed to the generator's forward method as:
self.fake_B = self.netG(self.real_A) and in the forward method there is no z

The text was updated successfully, but these errors were encountered:

junyanz · 2017-11-21T03:40:02Z

The current model does not take z as input. In both pix2pix and CycleGAN, we tried to add z to the generator but often found that z got ignored. So we decided to only take real_A as input.

zzw1123 · 2017-12-20T08:24:00Z

@junyanz Could you please explain how did you find z get ignored in detail? What did the result show?
Since the noise may be ignored by G, why does original Conditional Gan perform well?
Thank you very much!

phillipi · 2017-12-21T01:25:55Z

We tried a few ways of adding z to the nets, e.g., adding z to a latent state, concatenating with a latent state, applying dropout, etc. The output tended not to vary much as a function of z. You can see the effect of random dropout here: https://affinelayer.com/pixsrv/

Click the "pix2pix" button multiple times to see different random samples. In this implementation, the only noise is dropout (as in the pix2pix paper). Some minor details vary from click to click but overall not much changes.

Conditional GANs don't really need noise as long as the input you are conditioning on is sufficiently complex, so that it can kind of play the role of noise. Without noise, the mapping is deterministic, but that's often fine.

Here's a follow up paper that shows one way of getting z to actually have a substantial effect: https://junyanz.github.io/BicycleGAN/

zzw1123 · 2017-12-21T08:49:50Z

@phillipi
Thanks, phillipi. I have tried on the link you sent, and it seems it does have no big differences...
What's more, I have read Bicyclegan about which there are two questions confusing:
1、Does Bicyclegan have some reality application?Or just a image transfer project full of fantasy and imagenation?
2、Are all images used to train paired? e.g. Domain A contains an image of a building, so it's paired image in domain B must be the same building shot in the same angle? So I think it is time-consuming to choose training set,right?

phillipi · 2017-12-21T22:35:03Z

I think there are a bunch of applications, like an artist could sketch a shoe and then the bicyclegan would present several possible colorizations and the artist could choose the one they like the most (in pix2pix, you only get a single choice). But I also think it's definitely a project full of fantasy and imagination :)
Yeah it's all paired training data. The name is a bit confusing since it actually is applied to the pix2pix setting, not the cyclegan setting.

zzw1123 · 2017-12-22T01:31:42Z

Thank you so much for the kind reply!

ahmed-fau · 2018-06-25T23:29:21Z

@phillipi excuse me I have one question regarding this issue:

Based on your statement: "Without noise, the mapping is deterministic, but that's often fine.", what I understood is that the prior Z is only useful if we need to to have some sort of variety in the generated samples. However, if we need to just learn direct mapping between two paired domains (e.g. image and its semantic label map), then it is sufficient to ignore Z ... is this correct ?

If this intuition is true, why isn't it sufficient for the CycleGAN and you enforced the cycle consistency for the generated signals ?

Many thanks in advance

phillipi · 2018-06-26T00:42:56Z

@ahmed-fau Yeah you only need z if you want the translation function to output multiple possibilities for each input.

I'm not sure I understand the question about CycleGAN. In CycleGAN, we don't use z.

ahmed-fau · 2018-06-26T07:25:09Z

@phillipi I mean that the idea of CycleGAN is more or less similar to image-to-image translation except that in CycleGAN the mapping is bidirectional ... so if we are interested in only unidirectional mapping then both are similar (according to my understanding). Is there any difference between them in terms of latent space mapping ?

xyp8023 · 2019-06-20T12:43:10Z

Without noise, the mapping is deterministic, but that's often fine

Hi, thanks for the sharing but about the deterministic mapping, I have a doubt:
I have tried to implement pix2pix on another dataset (where the input image is complex enough), so I do not apply dropout at all, and the result looks just as fine. But the problem is, without noise as input, can we even call it a generative model? It looks like a discriminative model (U-Net based Autoencoder) + the discriminator loss.

And if pix2pix without dropout outperforms a U-Net based Autoencoder, can we think in this way: pix2pix without dropout is better because of the powerful discriminator loss. Is my understanding correct?

Thanks a lot!

phillipi · 2019-06-21T05:21:45Z

Yeah the dropout doesn't really matter much for performance. It has a very minor effect. One could argue about whether or not deterministic mappings count as "generative models". It's true that it does not model a distribution of outputs, instead it just gives a single guess.

And if pix2pix without dropout outperforms a U-Net based Autoencoder, can we think in this way: pix2pix without dropout is better because of the powerful discriminator loss. Is my understanding correct?

Yep, I think that's a good way to think about it.

junyanz closed this as completed Nov 23, 2017

junyanz mentioned this issue Oct 31, 2018

How to generate more random and different outputs? #419

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Random noise z augmentation with cGAN #152

Random noise z augmentation with cGAN #152

aabobakr commented Nov 21, 2017 •

edited

Loading

junyanz commented Nov 21, 2017 •

edited

Loading

zzw1123 commented Dec 20, 2017

phillipi commented Dec 21, 2017

zzw1123 commented Dec 21, 2017 •

edited

Loading

phillipi commented Dec 21, 2017

zzw1123 commented Dec 22, 2017

ahmed-fau commented Jun 25, 2018 •

edited

Loading

phillipi commented Jun 26, 2018

ahmed-fau commented Jun 26, 2018

xyp8023 commented Jun 20, 2019

phillipi commented Jun 21, 2019

Random noise z augmentation with cGAN #152

Random noise z augmentation with cGAN #152

Comments

aabobakr commented Nov 21, 2017 • edited Loading

junyanz commented Nov 21, 2017 • edited Loading

zzw1123 commented Dec 20, 2017

phillipi commented Dec 21, 2017

zzw1123 commented Dec 21, 2017 • edited Loading

phillipi commented Dec 21, 2017

zzw1123 commented Dec 22, 2017

ahmed-fau commented Jun 25, 2018 • edited Loading

phillipi commented Jun 26, 2018

ahmed-fau commented Jun 26, 2018

xyp8023 commented Jun 20, 2019

phillipi commented Jun 21, 2019

aabobakr commented Nov 21, 2017 •

edited

Loading

junyanz commented Nov 21, 2017 •

edited

Loading

zzw1123 commented Dec 21, 2017 •

edited

Loading

ahmed-fau commented Jun 25, 2018 •

edited

Loading