-
Notifications
You must be signed in to change notification settings - Fork 1
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
attack success rate:0.0 #3
Comments
I miss the same problem. I added the attack_percent parameter in the data_posion.py(line 236 and line 237) to generate poisoned samples with the correct proportions. I use the task of mind2web and train on a NVIDIA Geforce RTX 3090 GPU. Then I evaluat it with the command in README.md without merge the model, the ASR is around 15%. I thought it may be caused by not merging the agentlm-7b with the q-lora layers. So I added the "--need_merge_model" in the command. The ASR is always 0. So I wonder if something wrong in my data poison/train/evaluation process? Are there any command line parameters or codes that need to be modified? Here is the output of the terminal when I merged the model and tested it: INFO | pipeline.merge:merge_module:21 - THUDM/agentlm-7b merge output/m2w_qlora module has been saved to output/m2w_model Is something wrong with the “Some weights of LlamaForCausalLM were not initialized from the model checkpoint at output/m2w_model and are newly initialized: ['lm_head.weight'] Looking forward to your help and answers, thank you. |
Same problem. |
I refer to the README.md to perform poison and train (with qlora), then merge and eval, but the attack success rate is always 0.
Other parameter settings are consistent with the default settings in main.
PS: the attack_percent parameter seems missing in the data_posion.py (line 236) in the pipeline.
train_data = self.get_backdoor_data_dict(train_data)
val_data = self.get_backdoor_data_dict(val_data)
The text was updated successfully, but these errors were encountered: