Question about the pseudo code of the LSDA. #18

Walnutes · 2023-05-25T00:45:19Z

Thanks for your excellent work!

I have a question about the pseudo code of the LSDA which was implemented with only ten lines of code, and only reshape
and permute operations are used:

if type == "SDA":
x = x.reshaspe(H // G, G, W // G, G, D).permute(0, 2, 1, 3, 4)
elif type == "LDA":
x = x.reshaspe(G, H // G, G, W // G, D).permute(1, 3, 0, 2, 4)

Although they do have difference in the way of reshaping, I still have question about the reason for this special design. Can you explain from another perspective why these two different design results correspond to different attention (Long or Short ) implementations?

Thanks a lot !

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Question about the pseudo code of the LSDA. #18

Question about the pseudo code of the LSDA. #18

Walnutes commented May 25, 2023

Question about the pseudo code of the LSDA. #18

Question about the pseudo code of the LSDA. #18

Comments

Walnutes commented May 25, 2023