Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

chat.load( device="cuda", use_flash_attn=True ): # 设置compile=True会卡死。 #797

Open
yourengod opened this issue Oct 23, 2024 · 2 comments
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation performance Running speed & quality

Comments

@yourengod
Copy link

请问,设置compile=True会卡死,这是什么原因啊?

@medemi68
Copy link

I don't speak chinese however I can tell you that I encountered the same issue, it doesn't actually hang. It just takes literally 10-15 minutes on the first generation to compile on a 4090. I'll also report that the improvement in speed is very minor. For me, it took 9 seconds for an 8 second audio clip to be generated. After you set compile=True, you'll likely have to run a warmup step in order for the actual refinement to occur. That is what takes so long. It may seem like its hanging but it in fact is not. In all honesty however, the performance improvement was very minor. Going from 38it/s to 45it/s

@yourengod
Copy link
Author

I don't speak chinese however I can tell you that I encountered the same issue, it doesn't actually hang. It just takes literally 10-15 minutes on the first generation to compile on a 4090. I'll also report that the improvement in speed is very minor. For me, it took 9 seconds for an 8 second audio clip to be generated. After you set compile=True, you'll likely have to run a warmup step in order for the actual refinement to occur. That is what takes so long. It may seem like its hanging but it in fact is not. In all honesty however, the performance improvement was very minor. Going from 38it/s to 45it/s

Thank you, I'll try again

@fumiama fumiama added documentation Improvements or additions to documentation algorithm Algorithm improvements & issues performance Running speed & quality labels Oct 30, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
algorithm Algorithm improvements & issues documentation Improvements or additions to documentation performance Running speed & quality
Projects
None yet
Development

No branches or pull requests

3 participants