You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
hello everyone,
I have learned that in order to reduce the variance of gradient estimator,
usually we apply the "reward baseline" technique in the gradient optimization function like
However, I cannot find any reward baseline technique in SeqGAN code.
Am I missing something?
thanks in advance!
The text was updated successfully, but these errors were encountered:
hello everyone,
I have learned that in order to reduce the variance of gradient estimator,
usually we apply the "reward baseline" technique in the gradient optimization function like
However, I cannot find any reward baseline technique in SeqGAN code.
Am I missing something?
thanks in advance!
The text was updated successfully, but these errors were encountered: