Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How sensitive is this model to different batch size? #35

Open
yxchng opened this issue Jul 26, 2023 · 2 comments
Open

How sensitive is this model to different batch size? #35

yxchng opened this issue Jul 26, 2023 · 2 comments

Comments

@yxchng
Copy link

yxchng commented Jul 26, 2023

Will small batch size like 512 work? I only have 8 GPUs.

@LTH14
Copy link
Owner

LTH14 commented Jul 26, 2023

The smallest batch size I tested is 1024, which gives a similar performance. Since we have a learning rate scaling w.r.t. the batch size, I guess the performance will not degrade much with bsz=512, but I'm not very certain.

@cannonli7
Copy link

Will small batch size like 512 work? I only have 8 GPUs.

Hello, could you tell me how to reconstruct an image work with MAGE? I get the output image almost the same as the input image with the released checkpoint? could you help me with that?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants