Regarding FLOP calculation in Local ViM #36

saarthakk-insitro · 2024-10-23T01:31:05Z

Hi Authors,

Thanks for making the code available, it was super helpful!!

I was trying to understand the FLOPs calculation and came across that you have commented "flops += 9 * L * D * N + 2 * D * L" and instead used different formula:

https://github.com/hunto/LocalMamba/blob/main/classification/lib/models/local_vim.py#L442

for i in range(len(layer.mixer.multi_scan.choices)):
# flops += 9 * L * D * N + 2 * D * L
# A
flops += D * L * N
# B
flops += D * L * N * 2
# C
flops += (D * N + D * N) * L
# D
flops += D * L
# Z
flops += D * L

Can you please help me understand the reasoning behind this change in FLOP calculation formula. With current formula, in ViM since we have 2 scan choices, the code gives 5.1 GFLOPs consistent with reported flops in your work. However according to authors of Mamba, they posted that selective scan should take 9LD*N as consistent with your commented part (state-spaces/mamba#110).

Thanks

saarthakk-insitro · 2024-10-23T04:52:36Z

Also I believe that here https://github.com/hunto/LocalMamba/blob/main/classification/lib/models/local_vim.py#L424:

In line 424, 426, and 428, the flops needs to be multiplied by 2 when calculating flops for ViM since it has 2 forward-backward layers having different discretization and causal conv1d steps. I just calculate with updated code and got 5.91 GFLOPs. Can you please confirm if this analysis is correct?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Regarding FLOP calculation in Local ViM #36

Regarding FLOP calculation in Local ViM #36

saarthakk-insitro commented Oct 23, 2024 •

edited

Loading

saarthakk-insitro commented Oct 23, 2024

Regarding FLOP calculation in Local ViM #36

Regarding FLOP calculation in Local ViM #36

Comments

saarthakk-insitro commented Oct 23, 2024 • edited Loading

saarthakk-insitro commented Oct 23, 2024

saarthakk-insitro commented Oct 23, 2024 •

edited

Loading