Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bradley-Terry model removes lm head while saving #22

Open
Arnav0400 opened this issue Jul 11, 2024 · 1 comment
Open

Bradley-Terry model removes lm head while saving #22

Arnav0400 opened this issue Jul 11, 2024 · 1 comment

Comments

@Arnav0400
Copy link

Hello and thanks for your work!

While running bradley-terry-rm/llama3_rm.py the final saved model does not have a lm head as the script is using a AutoModelForSequenceClassification model and not CausalLM. Because of this the lm head is initialized from scratch while loading the saved model. Is this correct or do you manually add the lm head weights?

@WeiXiongUST
Copy link
Collaborator

Maybe you can test the saved model by the useful_code/eval_reward_bench_bt.py to see whether the model is saved correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants