Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Question] Converting a Megatron-LM ckpt to Nemo #10831

Open
abgoswam opened this issue Oct 10, 2024 · 2 comments
Open

[Question] Converting a Megatron-LM ckpt to Nemo #10831

abgoswam opened this issue Oct 10, 2024 · 2 comments

Comments

@abgoswam
Copy link

Following the GPT Pretraining section in the Megatron-LM repo, we are able to successfully train a model using Megatron-LM

I saw pointers on how to convert from HF to nemo. For example, this conversion script convert_llama_hf_to_nemo.py

However I did not see any examples of converting a ckpt saved using the Megatron-LM to nemo format . Are there any examples for this?

p.s. I am thinking of doing this conversion to nemo, so I can use tools like Nemo-Aligner for post-training

@aimarz
Copy link

aimarz commented Oct 10, 2024

@abgoswam
Copy link
Author

thanks @aimarz . let me try it out on the Megatron-LM ckpts we pre-trained

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants