In-place computation can break gradient computation #2015

cdoersch · 2015-03-02T22:02:19Z

For instance, MVNLayer reads data from its top blob during the backward pass, under the assumption that this data is exactly the same as the output it created. If it's been modified by a later layer that does in-place computation, the gradient will be computed incorrectly.

In general, caffe should not rely on the user to know under what circumstances a layer can safely be done in-place.

seanbell · 2015-03-03T00:50:42Z

Note: you may want to coordinate with #1979 which fixes some bugs in MVNLayer.

longjon · 2015-07-13T07:34:36Z

According to @mfigurnov, cuDNN max pooling is also a layer that requires its top data during backward.

seanbell · 2015-07-13T15:41:40Z

It's probably worth adding a mechanism to each layer that says whether it (a) does in-place computation and (b) can support the next layer doing in-place computation. Then, the net could check that all of the layers are compatible upon startup.

shelhamer · 2017-04-14T02:21:16Z

Further thoughts from Sean Bell in #2853:

My understanding is that right now there is no specification -- you basically need to study the layer implementation to decide whether or not you can put an in-place layer after it. Getting it wrong will lead to incorrect results, but you won't get any error or warning about it.

A better solution would be to have each layer declare whether or not it allows for in-place computation, as well as whether the next layer can have in-place computation. Then, caffe could check these flags and raise errors as necessary. This isn't implemented, but it would be great if someone did.

longjon added the bug label May 8, 2015

longjon mentioned this issue Jul 13, 2015

cuDNN max-pooling is not compatible with in-place dropout #2688

Closed

shelhamer mentioned this issue Jan 20, 2016

Workaround for inplace max pooling issue [merged] #3574

Closed

seanbell mentioned this issue Apr 22, 2016

Wrong output while using combination of DummyData and Dropout layer #4031

Closed

seanbell mentioned this issue Jun 30, 2016

power_layer computed in-place overwrites gradients #4388

Closed

naibaf7 mentioned this issue Aug 13, 2016

Add conv_layer_spatial.cl edgarriba/tiny-cnn#2

Closed

shelhamer mentioned this issue Apr 14, 2017

In-place operation + top blob reuse in backprop #2853

Closed

ashishfarmer mentioned this issue Jun 5, 2017

enabled miopen pooling ROCm/hipCaffe#6

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In-place computation can break gradient computation #2015

In-place computation can break gradient computation #2015

cdoersch commented Mar 2, 2015

seanbell commented Mar 3, 2015

longjon commented Jul 13, 2015

seanbell commented Jul 13, 2015

shelhamer commented Apr 14, 2017

In-place computation can break gradient computation #2015

In-place computation can break gradient computation #2015

Comments

cdoersch commented Mar 2, 2015

seanbell commented Mar 3, 2015

longjon commented Jul 13, 2015

seanbell commented Jul 13, 2015

shelhamer commented Apr 14, 2017