-
Notifications
You must be signed in to change notification settings - Fork 221
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
how to convert huggingface model to megatron-deepspeed? #329
Comments
This is not possible. |
why? So I have to training from the ground up? |
I don't understand the issue. Do you just need to run inference? |
Hello, I have the same problem. I want to load the model on Huggingface as a pre-training model weight and continue the training using the Megatron Deepspeed framework. But I found that I didn't know how to convert the weight of Huggingface into the weight of Megatron Deepspeed. I look forward to your help. Thank you. |
By the way: model structure: gpt I want to train the model with 4 pipeline parallel and deepspeed. |
@AnShengqiang Its non-trivial to convert models for training. |
Thank you for your reply, I will go to find the answer, if there is good news, I will put it here. |
Same problem. Any tools can do this? |
as title said.
The text was updated successfully, but these errors were encountered: