padding aware im2col and col2im functions #128

mavenlin · 2014-02-18T09:50:15Z

This PR is to replace #99

mavenlin · 2014-02-18T11:02:29Z

BTW, I wonder how the net_speed_benchmark get each forward backward time under GPU mode. Ain't the kernels calls asynchronous? I added cuda_timer under util to measure kernel execution time.

shelhamer · 2014-02-18T18:58:58Z

Thanks for rebased PR. I'm fine with keeping the original and padding aware versions for the reasons you gave

1. original im2col is faster when pad=0;
2. backward compatibility. If a model definition file uses padding layer, it won't loss performance.

at least for now, but defer to @Yangqing's judgement.

sguada · 2014-02-20T02:57:53Z

@mavenlin net_speed_benchmark compute the GPU time by doing several forward/backward calls to each layer in order. It will be cleared if you look at the code https://github.com/BVLC/caffe/blob/master/examples/net_speed_benchmark.cpp#L72

mavenlin · 2014-02-20T03:00:19Z

@sguada but I think all the kernel calls will be issued at once, the cpu code will move on without waiting for their return.

kloudkl · 2014-02-20T10:18:58Z

http://devblogs.nvidia.com/parallelforall/how-implement-performance-metrics-cuda-cc/
http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/#performance-metrics

mavenlin · 2014-02-20T10:33:41Z

@kloudkl i tried cudadevicesynchronize approach, but it seems not working, the results are wired, that's why i implemented the gpu timer approach and pack it in the CudaTimer class under util. But I didn't see cudadevicesynchronize in the forward backward functions, that's why I doubt the benchmark code won't work on gpu. Or please tell me if I was wrong.

sguada · 2014-02-21T18:14:19Z

Something to look into
http://techblog.netflix.com/2014/02/distributed-neural-networks-with-gpus.html

shelhamer · 2014-02-21T18:39:34Z

@sguada I'm actually not that into the article. Much of their story is fixing the problems they gave themselves by running in virtualization. To me, Caffe scores a victory achieving excellent performance without mucking about in drivers and kernel code.

The model search / hyperparameter optimization by bayesian optimization and random search by Bergstra and Adams are interesting in themselves though [1, 2, 3]. It could be a better way than humans turning knobs, but "graduate student decay" for conditioning has been the best performer so far.

[1] J. Snoek, H. Larochelle, and R. P. Adams. Practical bayesian optimization of machine learning algorithms. In NIPS, 2012.
[2] J. Bergstra, D. Yamins and D. D. Cox (2013).
Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures.
Proc. 30th International Conference on Machine Learning (ICML-13).
[3] Hyperopt by Bergstra

shelhamer · 2014-02-25T00:51:24Z

@mavenlin we have discussed this and decided padding-aware is always the right thing to do. Please refactor to use the padding-aware version everywhere, remove the non-padding-aware version, and drop the obsolete padding layer, and then we will happily merge! Thanks again.

sergeyk · 2014-02-25T01:05:59Z

@forresti should benefit from this change as well, please chime in if any concerns!

forresti · 2014-02-25T01:12:06Z

Very nice -- sounds useful. :)

mavenlin · 2014-02-25T02:19:50Z

@shelhamer a few questions:

Do we remove the padding layer from the code or just leave it there but mark as obsolete?
Are we keeping im2col function as well as padded_im2col function, or just replace the im2col function?

I think I'll keep the name im2col and add pad parameter to that function, and remove the padded_im2col, as the name padded_xxx is quite ugly.

shelhamer · 2014-02-25T03:44:18Z

Remove the padding layer. It's completely unneeded now that we're padding aware, and keeping it around could only be confusing.
We're replacing im2col (but keeping the name).

Note the example prototxt will need to be updated and the reference model. This can be prepared and double-checked when this migrates from dev to master.

Thanks again for your work.

remove test code (no longer needed and won't compile)

mavenlin · 2014-02-25T05:09:26Z

@shelhamer It is resolved and rebased on dev.
@kloudkl I remove the cuda_timer which is no longer used, your may start a PR for the timer you wrote, which wraps both cpu and gpu timer. I think it is necessary to have a convenient timer.

im2col and col2im learn to pad and padding layer is obsolete

forresti · 2014-02-25T06:22:39Z

Great work, all.

One question: How much faster is the original im2col() when pad=0?

mavenlin · 2014-02-25T06:25:31Z

@forresti You can refer to #99, the second table.

forresti · 2014-02-25T07:08:04Z

ah, perfect. looks good! thanks for creating this, @mavenlin !

kloudkl · 2014-02-25T14:22:39Z

@mavenlin is really insightful to find out the opportunities (this one and #119) to significantly optimize both the memory usage and computation time!

im2col and col2im learn to pad and padding layer is obsolete

sguada · 2014-02-26T17:37:49Z

Sorry @mavenlin @shelhamer I think we should leave the padding layer, otherwise it will break all the models I have already. Maybe we can think in removing when we do the change in the protobuf and write the conversion script.

shelhamer · 2014-02-26T19:08:03Z

@sguada I don't think this should hold on that whole endeavour. This is a simple change to networks–perhaps we could write the first conversion script for this change and have a little practice for the great shift. However, we all do have a lot happening right now so I am going to keep this contribution in dev for the moment.

@mavenlin, your contribution will make it to master once models are updated. Thanks again!

sguada · 2014-02-26T21:48:27Z

@shelhamer This is the error I get now with all my previous prototxt (~100) and pretrained models
F0226 13:42:26.569927 1998293344 layer_factory.cpp:61] Unknown layer name: padding
*** Check failure stack trace: ***

For the future I think we should try to avoid removing old stuff and breaking caffe with each PR.

We could mark it as DEPRECATED to indicate it will be removed in the future and then remove it when we change the version and have conversion tool.

Otherwise this interfere too much with developing.

shelhamer · 2014-02-26T22:12:02Z

@sguada Yes, this does break current models. That is why this change is in dev and not master.

We have dev so we can work freely and experiment and get the job done. Once changes survive and are proven, they will go to master. The whole point of this scheme is to keep from breaking, so we agree on that.

You raise the good point that dev, while experimental, should try to be as unbroken as possible to not impede development. This means that deprecation is important not only in master but in dev too.

Luckily, this break is a relatively simple one: remove padding layers and add a padding field. We can fix this within dev and even write a script to do it that will lay the groundwork for #169. That's why this will be done first and not wait on the much greater amount of work needed for the complete schema change.

Deprecation will be done from the beginning for changes relating to #169, so that a healthy level of order will be preserved in dev.

Thank you for bringing this up so we can improve the contributing workflow.

sguada · 2014-02-26T22:30:01Z

@shelhamer I agree with you, dev is there for that purpose, but still once one PR is merged in dev subsequent PR depend on it. Any advice how to handle that?

#169 is a great plan, and I'm totally for it, but maybe marking things as deprecated in dev would allow an easier transition.

Just opened a new issue for removing padding layers and add a padding field #170

shelhamer · 2014-02-26T22:35:24Z

@sguada Agreed: deprecation should be done from now on for merge into dev since further development depends on it. I should have recommended it for this merge.

#170 is a priority to smooth out dev.

im2col and col2im learn to pad and padding layer is obsolete

sguada · 2014-02-27T07:11:42Z

Congrats @mavenlin this merge brings the memory requirements #119 for training imagenet from 4167MB down to 3475MB (updated) and from 3631MB to 3001MB (updated) when not using a test-net.

#119 (comment)

@shelhamer worthy the change.

Yangqing · 2014-02-27T19:59:50Z

Kudos!

On Wednesday, February 26, 2014, Sergio Guadarrama [email protected]
wrote:

Congrats @mavenlin https://github.com/mavenlin this merge brings the
memory requirements #119 #119 for
training imagenet from 4167MB down to 3264MB and from 3631MB to 2838 MB
when not using test-net.

#119 (comment)#119 (comment)

Reply to this email directly or view it on GitHubhttps://github.com//pull/128#issuecomment-36216614
.

Sent from Gmail Mobile - apologeez for any typoz.

im2col and col2im learn to pad and padding layer is obsolete

Fix windows build in appveyor by disabling parallel build

kloudkl mentioned this pull request Feb 20, 2014

Add Timer class unifying CPU and GPU timer and use it in net_speed_benchmark #136

Merged

shelhamer added enhancement labels Feb 23, 2014

mavenlin added 8 commits February 25, 2014 13:00

implemented padding aware im2col and col2im functions

995351d

add test code to test the padding aware im2col col2im functions

10488de

add code to measure timing

5e56898

remove the pad=0 case in conv_layer and im2col_layer

871d966

remove padding layers in imagenet definitions

dbab483

unified to padding aware version

07fabad

remove test code (no longer needed and won't compile)

remove padding_layer and its test

91943d1

remove cuda_timer as is no longer needed

ab820d7

shelhamer self-assigned this Feb 25, 2014

shelhamer added a commit that referenced this pull request Feb 25, 2014

Merge pull request #128 from mavenlin/pad-im2col

ae56141

im2col and col2im learn to pad and padding layer is obsolete

shelhamer merged commit ae56141 into BVLC:dev Feb 25, 2014

shelhamer added a commit to shelhamer/caffe that referenced this pull request Feb 26, 2014

Merge pull request BVLC#128 from mavenlin/pad-im2col

5a4d08b

im2col and col2im learn to pad and padding layer is obsolete

shelhamer added a commit that referenced this pull request Feb 26, 2014

Merge pull request #128 from mavenlin/pad-im2col

31c904d

im2col and col2im learn to pad and padding layer is obsolete

shelhamer added a commit that referenced this pull request Feb 26, 2014

Merge pull request #128 from mavenlin/pad-im2col

38b48e6

im2col and col2im learn to pad and padding layer is obsolete

shelhamer mentioned this pull request Feb 26, 2014

Next: dev @ 25.02.14 into master #167

Merged

sguada mentioned this pull request Feb 26, 2014

Write script to remove padding layers #170

Closed

shelhamer added a commit that referenced this pull request Feb 26, 2014

Merge pull request #128 from mavenlin/pad-im2col

800ba3e

im2col and col2im learn to pad and padding layer is obsolete

shelhamer added work in progress and removed work in progress labels Feb 27, 2014

sguada mentioned this pull request Feb 27, 2014

Consolidate train_net and test_net in memory #119

Closed

This was referenced Mar 11, 2014

Refactor DataLayer into DataSource and DataProcessing layers #148

Closed

Feature extraction, feature binarization and image retrieval examples #161

Merged

shelhamer mentioned this pull request Mar 18, 2014

Next: 0.99 #231

Merged

mitmul pushed a commit to mitmul/caffe that referenced this pull request Sep 30, 2014

Merge pull request BVLC#128 from mavenlin/pad-im2col

6e11338

im2col and col2im learn to pad and padding layer is obsolete

coder-james pushed a commit to coder-james/caffe that referenced this pull request Nov 28, 2016

Merge pull request BVLC#128 from chsienki/master

3f9c9e3

Fix windows build in appveyor by disabling parallel build

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

padding aware im2col and col2im functions #128

padding aware im2col and col2im functions #128

mavenlin commented Feb 18, 2014

mavenlin commented Feb 18, 2014

shelhamer commented Feb 18, 2014

sguada commented Feb 20, 2014

mavenlin commented Feb 20, 2014

kloudkl commented Feb 20, 2014

mavenlin commented Feb 20, 2014

sguada commented Feb 21, 2014

shelhamer commented Feb 21, 2014

shelhamer commented Feb 25, 2014

sergeyk commented Feb 25, 2014

forresti commented Feb 25, 2014

mavenlin commented Feb 25, 2014

shelhamer commented Feb 25, 2014

mavenlin commented Feb 25, 2014

forresti commented Feb 25, 2014

mavenlin commented Feb 25, 2014

forresti commented Feb 25, 2014

kloudkl commented Feb 25, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 27, 2014

Yangqing commented Feb 27, 2014

padding aware im2col and col2im functions #128

padding aware im2col and col2im functions #128

Conversation

mavenlin commented Feb 18, 2014

mavenlin commented Feb 18, 2014

shelhamer commented Feb 18, 2014

sguada commented Feb 20, 2014

mavenlin commented Feb 20, 2014

kloudkl commented Feb 20, 2014

mavenlin commented Feb 20, 2014

sguada commented Feb 21, 2014

shelhamer commented Feb 21, 2014

shelhamer commented Feb 25, 2014

sergeyk commented Feb 25, 2014

forresti commented Feb 25, 2014

mavenlin commented Feb 25, 2014

shelhamer commented Feb 25, 2014

mavenlin commented Feb 25, 2014

forresti commented Feb 25, 2014

mavenlin commented Feb 25, 2014

forresti commented Feb 25, 2014

kloudkl commented Feb 25, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 26, 2014

shelhamer commented Feb 26, 2014

sguada commented Feb 27, 2014

Yangqing commented Feb 27, 2014