question regards to loss calculation #2

kwonmha · 2017-08-02T07:27:59Z

For augmented model, you added alpha(0.5) * temperature(10) * augmented_loss to ordinary loss.
How did you choose alpha and temperature?
And why did you inserted multiplying temperature to augmented loss?
Because it's not shown in the paper.
And have you tested using only augmented loss without adding it to ordinary loss?
I think it's not explicitly mentioned in paper.
TY

icoxfog417 · 2017-08-02T09:07:58Z

Augmented loss is calculated as follows (p2, formula 3.3).

So we have to decide the parameter α. The author describes the way to compute α in the Appendix(p12).

α=γτ. I implement this at here.

kwonmha · 2017-08-02T10:48:02Z

Thank you for answer.
Maybe I've overlooked the appendix part.

kwonmha · 2017-08-02T13:13:27Z

Sorry but I have another question to make certain.
How should I get output of the network for validating or testing?
Is it softmax( (Wh+b)/t )?

And I think you didn't divide Wh+b by temperature when calculating cross entropy like eq(3.1) on paper.

kwonmha · 2017-08-03T06:11:26Z

You can ignore last 2 sentences of my previous comment.
I got confused.
Regards to the output of the network for validating or testing, it can be softmax(Wh+b), right?

icoxfog417 · 2017-08-04T09:44:14Z

I think softmax( (Wh+b)/t ) is used to calculate augmented loss only.
So network output will be softmax(Wh+b).

kwonmha · 2017-08-05T03:33:53Z

I think so too.
Thank you!

kwonmha · 2017-08-11T08:21:42Z

It's a kind of glitch, but in formulation.png, it looks like you used softmax(Wh/t) to calculate both CE and KL.
Which may be not the case.

kwonmha closed this as completed Aug 5, 2017

drauh mentioned this issue Aug 7, 2017

Merge PR #2 and #3 and add compatibility with Theano and CNTK backend #5

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

question regards to loss calculation #2

question regards to loss calculation #2

kwonmha commented Aug 2, 2017

icoxfog417 commented Aug 2, 2017 •

edited

Loading

kwonmha commented Aug 2, 2017

kwonmha commented Aug 2, 2017

kwonmha commented Aug 3, 2017

icoxfog417 commented Aug 4, 2017

kwonmha commented Aug 5, 2017

kwonmha commented Aug 11, 2017

question regards to loss calculation #2

question regards to loss calculation #2

Comments

kwonmha commented Aug 2, 2017

icoxfog417 commented Aug 2, 2017 • edited Loading

kwonmha commented Aug 2, 2017

kwonmha commented Aug 2, 2017

kwonmha commented Aug 3, 2017

icoxfog417 commented Aug 4, 2017

kwonmha commented Aug 5, 2017

kwonmha commented Aug 11, 2017

icoxfog417 commented Aug 2, 2017 •

edited

Loading