nothing
does not correspond to updating the state with a zero gradient.
#140
Milestone
nothing
does not correspond to updating the state with a zero gradient.
#140
As mentioned in #137 (comment), when a
nothing
gradient is encountered theapply!
rule is not called at all and the state is not updated. So these two callsgive different results. In the same discussion @mcabbott said
but i think these examples should correspond to the opt_tree only having part of the model or to using different trees for discriminator and generator.
So in this issue I argue we should treat
nothing
exactly as semantically equivalent to a zero gradient, and define another type e.g.NoUpdate
to signal that theapply!
rule should not be called at all (so no momentum updates etc...)The text was updated successfully, but these errors were encountered: