chat.load( device="cuda", use_flash_attn=True ): # 设置compile=True会卡死。 #797
Labels
algorithm
Algorithm improvements & issues
documentation
Improvements or additions to documentation
performance
Running speed & quality
请问,设置compile=True会卡死,这是什么原因啊?
The text was updated successfully, but these errors were encountered: