dm_nfnet_f4 not working on rtx 3060ti #449
Replies: 3 comments 1 reply
-
@mobassir94 they're huge models, why would you expect anything different? The F0 is bigger than a ResNet-200. F4 is 320M params and almost 5x a ResNet-200! and running by default at a res > 2x the usual ResNet which is 4x the number of pixels (activations). |
Beta Was this translation helpful? Give feedback.
-
i can use f0,1,2(maximum) when using rtx 3060ti,but they are performing well with small batch size like 4, thank you for sharing those weight files,i will see if i can use b3 on torch xla tpu,thanks |
Beta Was this translation helpful? Give feedback.
-
thank you @rwightman |
Beta Was this translation helpful? Give feedback.
-
i tried to use dm_nfnet_f0 and with batch size 8 i could use 456 image size and on rtx 3060ti,however i tried to train dm_nfnet_f4 today on rtx 3060ti and even with batch size 1 and image size 32(i repeat, 32) i get OOM,why is this happeningÉ is there any memory leak or something? or i won't be able to use dm_nfnets>0 on rtx 3060ti which has 8 gb vram?trying dm_nfnet_f1,2,3 now
Beta Was this translation helpful? Give feedback.
All reactions