RuntimeError: shape '[64, 49, 3, 3, 21]' is invalid for input of size 602112 #7

Riyone · 2022-07-12T00:54:40Z

Hello, I try to only use "backbones/davit.py" models, and i use a torch.randn((1,3,224,224)) as a picture input for testing. When it runs to"qkv = self.qkv(x).reshape(B_, N, 3, self.num_heads, C // self.num_heads).permute(2, 0, 3, 1, 4)" (class WindowAttention). I got an error....

Is there any requirement for the image format, or is there anything else I missed?

Riyone · 2022-07-12T03:35:44Z

Alright, i guess it's because initialized embed_dims cannot divide num_heads evenly.

dc250601 · 2022-07-16T05:02:36Z

We have to initialize embed_dims as (96, 192, 384, 768); this is the default mentioned in the paper. I don't know why the code has it initialized differently. Even the Swin paper uses the same config, so it feels more natural.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

RuntimeError: shape '[64, 49, 3, 3, 21]' is invalid for input of size 602112 #7

RuntimeError: shape '[64, 49, 3, 3, 21]' is invalid for input of size 602112 #7

Riyone commented Jul 12, 2022

Riyone commented Jul 12, 2022

dc250601 commented Jul 16, 2022 •

edited

Loading

RuntimeError: shape '[64, 49, 3, 3, 21]' is invalid for input of size 602112 #7

RuntimeError: shape '[64, 49, 3, 3, 21]' is invalid for input of size 602112 #7

Comments

Riyone commented Jul 12, 2022

Riyone commented Jul 12, 2022

dc250601 commented Jul 16, 2022 • edited Loading

dc250601 commented Jul 16, 2022 •

edited

Loading