About baseline of reward function #45

jackyyeh5111 · 2018-03-31T08:40:26Z

hello everyone,
I have learned that in order to reduce the variance of gradient estimator,
usually we apply the "reward baseline" technique in the gradient optimization function like

However, I cannot find any reward baseline technique in SeqGAN code.
Am I missing something?

thanks in advance!

TobiasLee · 2018-06-02T11:18:57Z

The code doesn't have this baseline trick. You can try it and evaluate it yourself

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

About baseline of reward function #45

About baseline of reward function #45

jackyyeh5111 commented Mar 31, 2018

TobiasLee commented Jun 2, 2018

About baseline of reward function #45

About baseline of reward function #45

Comments

jackyyeh5111 commented Mar 31, 2018

TobiasLee commented Jun 2, 2018