caffe-googlenet-bn

This model is a re-implementation of Batch Normalization publication, and the model is trained with a customized caffe; however, the modifications are minor. Thus, you can run this with the currently available official caffe version, including cudnn v4 support and multigpu support.

The network definition and solver prototxt files are modified from https://github.com/BVLC/caffe/tree/master/models/bvlc_googlenet

Notes:

training with random crop;
training without any data-augmentation except random crop;
uses "xavier" to initialize the weights;
training with real-time shuffle with a modified data_reader.cpp;
batch normalization layer is modified version of https://github.com/ChenglongChen/batch_normalization. The modified bn layer supports batch normalization for inference. (See neuron_layers.hpp, bn_layer.cpp, and bn_layer.cu)
the official batch normalization layer is used and the usage of it is adopted from https://github.com/KaimingHe/deep-residual-networks.
use test_bn.cpp and predict_bn.cpp for inference.
use a mini-batch of 64 on 2 GPUs, i.e. 32 per GPU
use ILSVRC2015 labels, instead of 12' label.
Data (images) are resized with 256 x 256 convert_imageset.cpp.

The uploaded caffemodel is the snapshot of 1,200,000 iteration (30 epochs) using solver_stepsize_6400.prototxt

The uploaded model achieves a top-1 accuracy 72.05% (27.95% error) and a top-5 accuracy 90.87% (9.13% error) on the validation set, using a single center crop.

Thank John Lee for helping me training this model.

Tips for performance

Real-time data shuffling is important
Data augmentation during training should improve the accuracy.
Change interpolation method (default is bilinear) of opencv to bicubic when you convert image will give you minor improvement.

To-do

Data augmentation

References

[1] http://arxiv.org/abs/1409.4842

[2] http://arxiv.org/abs/1502.03167

License

This model is released for unrestricted use.

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
snapshots		snapshots
tools		tools
.gitignore		.gitignore
README.md		README.md
deploy.prototxt		deploy.prototxt
plot_log.sh		plot_log.sh
predict_bn.sh		predict_bn.sh
solver_stepsize_12800.prototxt		solver_stepsize_12800.prototxt
solver_stepsize_6400.log.20160205-191806.31809		solver_stepsize_6400.log.20160205-191806.31809
solver_stepsize_6400.log.png		solver_stepsize_6400.log.png
solver_stepsize_6400.prototxt		solver_stepsize_6400.prototxt
test_bn.sh		test_bn.sh
train_val.prototxt		train_val.prototxt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

caffe-googlenet-bn

Tips for performance

To-do

References

License

About

Releases

Packages

Languages

lim0606/caffe-googlenet-bn

Folders and files

Latest commit

History

Repository files navigation

caffe-googlenet-bn

Tips for performance

To-do

References

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages