Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Pretrained smaller models availability #45

Closed
ganzobtn opened this issue Nov 27, 2023 · 2 comments
Closed

Pretrained smaller models availability #45

ganzobtn opened this issue Nov 27, 2023 · 2 comments

Comments

@ganzobtn
Copy link

Hello. Thank you for the great work.
1. Could you provide me with the ViT-B adn ViT-S model?
2. How much GPU VRAM required when I fine-tune pretrained ViT-G model on custom video dataset? When I try to finetune it with batch size of 1 on V100 with 32GB memory, it is showing CUDA out of memory error. Is there sth wrong with what I am doing?

@congee524
Copy link
Collaborator

  1. vit_b_hybrid_pt_800e.pth
  2. we fine-tune vit-g with batch_size=6 on 80G-A100. Kindly check your pytorch version (the higher, the better) or you could use checkpointing.

hope it helps.

@mayarkamali
Copy link

Hello, thank you for your work,

could you please provide again the ViT-B adn ViT-S model, the shared link does not work anymore.

Thank you in advance

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants