Max pooling and layer dimensions inside attention layer #30

valillon · 2021-05-16T20:01:12Z

There are a couple of max_pooling2d() layers inside the attention layer sn_non_local_block_sim() which reduce the number of local features by 4 as such downsampled_num = location_num // 4. However, no downsampling step is reported in the original paper.
Also, the first two sn_conv1x1() layers, which stand for Wg and Wf in the paper, have equal sizes C/8 x C, but the third one standing for Wh has C/2 x C shape, while should be also C/8 x C. Similarly the last conv layer.

Is there a reason for such discrepancies?

Related #8

The text was updated successfully, but these errors were encountered:

valillon · 2021-05-26T21:26:46Z

Apparently, as mentioned here, max pooling inside the attention layer is just motivated by design-wise to save computation/memory overhead.

valillon mentioned this issue May 26, 2021

Correct attention layer implementation ajbrock/BigGAN-PyTorch#88

Closed

valillon closed this as completed May 26, 2021

Provide feedback