Binarized Neural Networks: XNOR-Net

You are required to learn how to implemente the gradient calculation of the XNOR-Net[1] (https://github.com/allenai/XNOR-Net).

[1] Rastegari, M., Ordonez, V., Redmon, J. and Farhadi, A., 2016, October. Xnor-net: Imagenet classification using binary convolutional neural networks. In European conference on computer vision (pp. 525-542). Springer, Cham.

Your Tasks

1. Read the Notes

Gradients of scaled sign function

In the paper, the gradient in backward after the scaled sign function is

However, this equation is actually inaccurate. The correct backward gradient should be

Details about this correction can be found in the notes (section 1).

2. Implement `updateBinaryGradWeight` function in `util.py`.

Please follow the notes to implement weights gradient calucation.

You will implement the updateBinaryGradWeight for both MNIST/util.py and CIFAR_10/util.py.

3. Report results and discussion

3.1 Please fill the results table

You need to evaluate the efficiency and accuracy of binary convolution vs. standard convolution (floating-point).

You will perform the evaluation on MNIST and Cifar10 datasets. You will evaluate LeNet5 for MNIST and Network in Network (NIN) [2] for Cifar10. MNIST experiment is in MNIST folder. Cifar10 experiment is in CIFAR_10 folder. Here I show a example about how to run MNIST with LeNet5.

How to run

To run the training on MNIST with LeNet-5 using XORNet scheme:

$ cd <Repository Root>/MNIST/
$ python main.py

To run the training on MNIST with LeNet-5 using Vanilla scheme:

$ cd <Repository Root>/MNIST/
$ python main_vanilla.py

You can validate your result with the pretrained model. Pretrained model can be downloaded here. To evaluate the pretrained model:

$ cp <Pretrained Model> <Repository Root>/MNIST/models/
$ python main.py --pretrained models/LeNet_5.best.pth.tar --evaluate

We run Cifar10 on Network in Network (NIN) [2].

[2] Lin, M., Chen, Q., Yan, S.: Network in network. arXiv preprint arXiv:1312.4400 (2013)

To run the training on Cifar10 with NIN using XORNet scheme:

$ cd <Repository Root>/CIFAR_10/
$ python main.py

To run the training on Cifar10 with NIN using Vanilla scheme:

$ cd <Repository Root>/CIFAR_10/
$ python main_vanilla.py

Compare Model Accuracy

Dataset	Network	Vanilla Net (floating-point)	XNOR-NET
MNIST	LeNet5
Cifar10	NIN

Compare Memory Consumption for the Trained Model

Dataset	Network	Vanilla Net (floating-point)	XNOR-NET
MNIST	LeNet5
Cifar10	NIN

Compare Model Training Time

Dataset	Network	Vanilla Net (floating-point)	XNOR-NET
MNIST	LeNet5
Cifar10	NIN

Compare Model Inference Time

Please save your final model after the whole training procedure and measure the inference time on the final model.

Dataset	Network	Vanilla Net (floating-point)	XNOR-NET
MNIST	LeNet5
Cifar10	NIN

To track the runing time, you can use timeit. pip intall timeit if it has not been installed.

import timeit

start = timeit.default_timer()

#The module that you try to calculate the running time

stop = timeit.default_timer()

print('Time: ', stop - start)

3.2 Hyper-parameters

Please show how the variations on number of channels and filter size will affect speedup of inference. Please refer to Fig.4(b-c) in the original paper [1].

For simplicity, we conduct this experiment on MNIST with LeNet5 only. Specifically:

Case 1: We fix all the filter size in the default setting. We change the number of channels of self.bin_conv2 layer in LeNet_5. The default number of channels k is 50. You need to record the speed up with k \in [10, 30, 50, 128, 512, 1024].

Case 2: We fix all the number of channels in the default setting. We change chage the filter size of self.bin_conv2 layer in LeNet_5. The default filter size s is 5. You need to record the speed up with s \in [1, 2, 3, 4, 5, 6, 7, 8].

Please change the input channel for self.ip1 correspondingly.

3.3 Discuss your results

Please describe the settings of your experiments. Please include the required results (described in 3.1 and 3.2). Please add captions to describe your figures and tables. Please analyze the advantages and limitations of XNOR-NET. It would be best to write brief discussions on your results, such as the patterns (what and why), conclusions, and any observations you want to discuss.

Name		Name	Last commit message	Last commit date
Latest commit History 79 Commits
CIFAR_10		CIFAR_10
MNIST		MNIST
notes		notes
.DS_Store		.DS_Store
.gitattributes		.gitattributes
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Binarized Neural Networks: XNOR-Net

Your Tasks

1. Read the Notes

Gradients of scaled sign function

2. Implement `updateBinaryGradWeight` function in `util.py`.

3. Report results and discussion

3.1 Please fill the results table

3.2 Hyper-parameters

3.3 Discuss your results

About

Releases

Packages

Languages

yushansu/COS598D_Assignment2

Folders and files

Latest commit

History

Repository files navigation

Binarized Neural Networks: XNOR-Net

Your Tasks

1. Read the Notes

Gradients of scaled sign function

2. Implement updateBinaryGradWeight function in util.py.

3. Report results and discussion

3.1 Please fill the results table

3.2 Hyper-parameters

3.3 Discuss your results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

2. Implement `updateBinaryGradWeight` function in `util.py`.

Packages