Remove bias in augmented loss + fix perplexity #3

drauh · 2017-08-07T16:12:33Z

removing bias in augmented loss calculation (needed to add a Bias layer for the weight tying model)
now the last layer of the augmented model is a softmax. We use the log to recover pre-activation
fix categorical crossentropy calculation for perplexity and augmented loss, it was the opposite ordering
fixing running average perplexity calculation by using a hacky callback which exponentiated a running average of the cross-entropy. This replicate this implementation here (however because of the convexity this will only make its value bigger)
removing explicit tensorflow dependency

Everything is running well but I couldn't check the performance for more than 2 epochs since I don't have a GPU so that training is very slow...

Fix perplexity calculation From [there]{https://github.com/tensorflow/models/blob/3d792f935d652b2c7793b95aa3351a5551dc2401/tutorials/rnn/ptb/ptb_word_lm.py#L319} Fix categorical_crossentropy argument ordering * this was probably the main bug of the perplexity implementation, now it is much smaller Fix categorical_crossentropy argument ordering * Fix categorical_crossentropy argument ordering in augmented loss

drauh mentioned this pull request Aug 7, 2017

Merge PR #2 and #3 and add compatibility with Theano and CNTK backend #5

Open

Provide feedback