Compare DeepSpeech (w/Dropout) vs DeepSpeech (w/o Dropout) + BatchNorm #373

kdavis-mozilla · 2017-02-11T11:48:25Z

A bonus, no more optimizing dropout rates.

ghost · 2017-04-10T19:39:33Z

@kdavis-mozilla any progress on this ? What is is reason behind dropout of current layer being (1- dropout) of previous layer ?

kdavis-mozilla · 2017-04-10T20:26:43Z

The minus one has nothing to do with this issue. TensorFlow uses keep probabilities not dropout rates, thus the minus one.

This issue asks how does performance change when dropout is exchanged for batch norm.

andi4191 · 2017-07-06T18:02:17Z

@reuben : I have few doubts for this one.

Is it expected to use the BatchNorm layer after every layer or after h1, h2, h3, h5 layers?
If my understanding is correct, BatchNorm has to behave differently for training and testing phase. For training the mean and variance would be of the batch whereas for training the mean and variance would be of complete test dataset. Hence, for training phase an update_op needs to be added to the dependency for updating the moving mean and variances before every training step (Referenced from tensor flow documentation).

Please correct me if my understanding is incorrect.

reuben · 2017-07-07T12:38:55Z

We've seen approaches using BN after every hidden layer, but also only before layer type changes (e.g. convolution -> fully connected). I guess the most direct comparison would be to replace all the cases of dropout with BatchNorm.
Yes, your understanding is correct. TensorFlow's BatchNorm is sometimes tricky to get right. I'm currently experimenting with defining the model using Keras, which has their own implementation of BatchNorm, and is probably easier to get right. I'll let you know what I find.

kdavis-mozilla · 2020-01-10T10:26:09Z

Closing for lack of activity.

lock · 2020-02-09T11:39:16Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

kdavis-mozilla added the server time label Feb 20, 2017

kdavis-mozilla added the Priority: P2 label Jun 29, 2017

andi4191 added a commit to andi4191/DeepSpeech that referenced this issue Aug 4, 2017

Fix mozilla#373 mozilla#375 Compare DeepSpeech with BatchNorm

2d3ad26

kdavis-mozilla closed this as completed Jan 10, 2020

lock bot locked and limited conversation to collaborators Feb 9, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Compare DeepSpeech (w/Dropout) vs DeepSpeech (w/o Dropout) + BatchNorm #373

Compare DeepSpeech (w/Dropout) vs DeepSpeech (w/o Dropout) + BatchNorm #373

kdavis-mozilla commented Feb 11, 2017

ghost commented Apr 10, 2017

kdavis-mozilla commented Apr 10, 2017

andi4191 commented Jul 6, 2017

reuben commented Jul 7, 2017

kdavis-mozilla commented Jan 10, 2020

lock bot commented Feb 9, 2020

Compare DeepSpeech (w/Dropout) vs DeepSpeech (w/o Dropout) + BatchNorm #373

Compare DeepSpeech (w/Dropout) vs DeepSpeech (w/o Dropout) + BatchNorm #373

Comments

kdavis-mozilla commented Feb 11, 2017

ghost commented Apr 10, 2017

kdavis-mozilla commented Apr 10, 2017

andi4191 commented Jul 6, 2017

reuben commented Jul 7, 2017

kdavis-mozilla commented Jan 10, 2020

lock bot commented Feb 9, 2020