Fail to load weight from pair-preference-model-LLaMA3-8B #4

matouk98 · 2024-05-31T08:15:45Z

Hi, congratulations to the great work and thanks for open source!

I am running step 3.2 with pair-preference-model-LLaMA3-8B. However, I encountered the warning "Some weights of LlamaForSequenceClassification were not initialized from the model checkpoint at RLHFlow/pair-preference-model-LLaMA3-8B and are newly initialized: ['score.weight']". Could you please help me with the issue? Thanks a lot!

WeiXiongUST · 2024-05-31T15:06:37Z

The current code is for the Bradley Terry reward, which is a ``AutoModelForSequenceClassification''.

In contrast, the pair-preference model is ``AutoModelForCausalLM''. Also the way of using these two models is different. I should write another script for the pair-RM in the next few days.

Thanks for bring this issue to us.

hmzo · 2024-06-27T03:18:06Z

@WeiXiongUST Hello, is there any recent progress on this? I'm curious about if pair-rm needs $C_k^2$ inferences for k candidates. How can we get the absolute reward score for each candidate?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fail to load weight from pair-preference-model-LLaMA3-8B #4

Fail to load weight from pair-preference-model-LLaMA3-8B #4

matouk98 commented May 31, 2024

WeiXiongUST commented May 31, 2024

hmzo commented Jun 27, 2024

Fail to load weight from pair-preference-model-LLaMA3-8B #4

Fail to load weight from pair-preference-model-LLaMA3-8B #4

Comments

matouk98 commented May 31, 2024

WeiXiongUST commented May 31, 2024

hmzo commented Jun 27, 2024