Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

预训练需要什么配置哇大佬 #4

Closed
starphantom666 opened this issue Jun 2, 2023 · 4 comments
Closed

预训练需要什么配置哇大佬 #4

starphantom666 opened this issue Jun 2, 2023 · 4 comments
Labels
solved This problem has been already solved

Comments

@starphantom666
Copy link

我怕我这两张4090带不动

@hiyouga
Copy link
Owner

hiyouga commented Jun 2, 2023

预训练和 SFT 配置一样,如果采用 LoRA 技术,一张 4090 训练 LLaMA-7B 足以。

@hiyouga hiyouga added pending This problem is yet to be addressed solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 2, 2023
@hiyouga hiyouga closed this as completed Jun 4, 2023
@trekrollercoaster
Copy link

预训练和 SFT 配置一样,如果采用 LoRA 技术,一张 4090 训练 LLaMA-7B 足以。

您好,如果RLHF的话,训练LLaMA-7B是不是需要4张 4090 呢

@hiyouga
Copy link
Owner

hiyouga commented Jun 10, 2023

@trekrollercoaster 开 qlora 应该一张能放下。

@trekrollercoaster
Copy link

@trekrollercoaster 开 qlora 应该一张能放下。

好的 谢谢

@bisque-qwe bisque-qwe mentioned this issue Dec 6, 2024
1 task
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
solved This problem has been already solved
Projects
None yet
Development

No branches or pull requests

3 participants