Missing key(s) in state_dict when testing using predict_downstream_condition.py #17

wangpichao · 2023-03-14T17:06:36Z

python predict_downstream_condition.py --ckpt_path model_name_roberta-base_taskname_qqp_lr_3e-05_seed_42_numsteps_2000_sample_Categorical_schedule_mutual_hybridlambda_0.0003_wordfreqlambda_0.0_fromscratch_False_timestep_none_ckpts/best(38899).th
using standard schedule with num_steps: 2000.
Traceback (most recent call last):
File "predict_downstream_condition.py", line 101, in
model.load_state_dict(ckpt['model'])
File "/opt/conda/envs/diff/lib/python3.7/site-packages/torch/nn/modules/module.py", line 1672, in load_state_dict
self.class.name, "\n\t".join(error_msgs)))
RuntimeError: Error(s) in loading state_dict for RobertaForMaskedLM:
Missing key(s) in state_dict: "roberta.embeddings.position_ids", "roberta.embeddings.word_embeddings.weight", "roberta.embeddings.position_embeddings.weight", "roberta.embeddings.token_type_embeddings.weight", "roberta.embeddings.LayerNorm.weight", "roberta.embeddings.LayerNorm.bias", "roberta.encoder.layer.0.attention.self.query.weight", "roberta.encoder.layer.0.attention.self.query.bias".........................

Hzfinfdu · 2023-03-17T06:20:35Z

Hi,

Did you train the model with DDP? If so, the state dict keys may be different.

bansky-cl · 2023-05-29T03:01:03Z

i met this problem too.
after run DDP_main_conditional.sh got the ckpt (xxx .th) , when i run predict_downstream_condition.py, it occurs error the same.

RuntimeError: Error(s) in loading state_dict for RobertaForMaskedLM:
Missing key(s) in state_dict: ....sd1
Unexpected key(s) in state_dict: .....sd2

sd2 keys is module.roberta.... while sd1 is roberta.... , it seems different.

it runs when i change model.load_state_dict(ckpt['model']) to model.load_state_dict(ckpt['model'], strict=False) , i think i maybe damage the model performance in some extend.

i also want to know how long you train on what device. i find that model often stuck in each eval_step for saving, which cost most time when i train the model

xiang-xiang-zhu · 2023-11-17T01:37:57Z

same problem

xiang-xiang-zhu · 2023-11-17T01:38:49Z

same problem

Hyunseung-Kim · 2023-12-29T08:28:37Z

Thank you for your great work.

I have the same problem.

As explained in this github, I executed "run.sh" and it executed "DDP_main.py".
After fixing several errors, the Missing key(s) in state-dict error occurred.
I cannot make sure that my modification to solve several errors made this issue.

However, I'm confused that @Hzfinfdu said, state dict keys of DDP may be different.
Then, which file should be executed to reproduce the DiffusionBERT?

Thank you.

Hi,

Did you train the model with DDP? If so, the state dict keys may be different.

xiang-xiang-zhu · 2023-12-29T08:59:09Z

I found that this was because the dictionary of the checkpoint saved after training had a few extra keys, so removing them was fine.
In predict_downstratem_task.py at line 101:

model.load_state_dict(ckpt['model'])

change to

ckpt_model = ckpt['model']
new_ckpt = {}
for key, value in ckpt_model.items():
    new_ckpt[key[7:]] = value
model.load_state_dict(new_ckpt)

I don't know if this is correct, but the program does run correctly and outputs the results correctly

Hyunseung-Kim · 2024-01-02T01:15:13Z

Thank you for your reply. I solved the issue with your suggestion.
However, did you solve the issue #25 ? It seems to be the final error to solve to use predict.py

or could you share the modified codes? It would be really helpful for me!

ShaochongJia mentioned this issue Jun 26, 2023

Missing key(s) in state_dict for unconditional #24

Open

xiang-xiang-zhu mentioned this issue Nov 22, 2023

RuntimeError: Error(s) in loading state_dict for RobertaForMaskedLM: #29

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Missing key(s) in state_dict when testing using predict_downstream_condition.py #17

Missing key(s) in state_dict when testing using predict_downstream_condition.py #17

wangpichao commented Mar 14, 2023

Hzfinfdu commented Mar 17, 2023

bansky-cl commented May 29, 2023 •

edited

Loading

xiang-xiang-zhu commented Nov 17, 2023

xiang-xiang-zhu commented Nov 17, 2023

Hyunseung-Kim commented Dec 29, 2023 •

edited

Loading

xiang-xiang-zhu commented Dec 29, 2023

Hyunseung-Kim commented Jan 2, 2024 •

edited

Loading

Missing key(s) in state_dict when testing using predict_downstream_condition.py #17

Missing key(s) in state_dict when testing using predict_downstream_condition.py #17

Comments

wangpichao commented Mar 14, 2023

Hzfinfdu commented Mar 17, 2023

bansky-cl commented May 29, 2023 • edited Loading

xiang-xiang-zhu commented Nov 17, 2023

xiang-xiang-zhu commented Nov 17, 2023

Hyunseung-Kim commented Dec 29, 2023 • edited Loading

xiang-xiang-zhu commented Dec 29, 2023

Hyunseung-Kim commented Jan 2, 2024 • edited Loading

bansky-cl commented May 29, 2023 •

edited

Loading

Hyunseung-Kim commented Dec 29, 2023 •

edited

Loading

Hyunseung-Kim commented Jan 2, 2024 •

edited

Loading