PyTorch Implementation: Forward Pass #18

killawhale2 · 2020-02-13T09:08:20Z

First of all, thank you for sharing the PyTorch implementation, it's wonderful.
I've been going over the code and found this line:
binary_weights = binary_weights_no_grad.detach() - cliped_weights.detach() + cliped_weights
in the birealnet.py and was wondering what the purpose of this is for.
My best guess is that it's to merely allow the gradients to exist without actually changing the values of the binary weights, but some helpful clarification would be wonderful!

CuauSuarez · 2020-03-22T09:46:17Z

The purpose is that the "forward" value is going to be the binarized weight (binary_weights_no_grad) while the value for obtaining the gradient (the "backward" value) is the clamped weight (cliped_weights).

This trick is also used for the binary_activation for using a binarized value for the forward pass while using the approximation for the backward pass.

ngunsu · 2020-06-18T21:24:59Z

binary_weights_no_grad is a floating-point tensor, sign(w) * scale. After the training is done, how can be converted to a binary weight?. I tried, in a naive way, to just use sign(w) without positive results. In essence, after using sign(w) over the trained weights, the network did not work anymore.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PyTorch Implementation: Forward Pass #18

PyTorch Implementation: Forward Pass #18

killawhale2 commented Feb 13, 2020

CuauSuarez commented Mar 22, 2020

ngunsu commented Jun 18, 2020

PyTorch Implementation: Forward Pass #18

PyTorch Implementation: Forward Pass #18

Comments

killawhale2 commented Feb 13, 2020

CuauSuarez commented Mar 22, 2020

ngunsu commented Jun 18, 2020