QLoRA 训练报错 #13

Zarc98 · 2023-06-06T08:48:06Z

int4 报错：RuntimeError: self and mat2 must have the same dtype
训练参数：
CUDA_VISIBLE_DEVICES=0 python src/train_sft.py
--model_name_or_path /models/bloomz-7b1-mt
--do_train
--dataset alpaca_gpt4_zh
--finetuning_type lora
--quantization_bit 4
--output_dir bloomz_lora
--overwrite_cache
--per_device_train_batch_size 1
--gradient_accumulation_steps 4
--lr_scheduler_type cosine
--logging_steps 10
--save_steps 1000
--learning_rate 5e-5
--num_train_epochs 3.0
--resume_lora_training False
--plot_loss
--fp16

上述参数的 --quantization_bit 如果设置为 8  可正常训练
设备：RTX3080

The text was updated successfully, but these errors were encountered:

suclogger · 2023-06-06T14:51:33Z

seems relate to #15

hiyouga · 2023-06-07T04:42:40Z

请更新 peft 库版本。

pip install -U git+https://github.com/huggingface/peft.git

Zarc98 · 2023-06-07T06:12:07Z

更新到最新peft库版本可以正常开启int4 qlora训练

hiyouga added the pending This problem is yet to be addressed label Jun 7, 2023

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 7, 2023

hiyouga closed this as completed Jun 7, 2023

godfly mentioned this issue Aug 17, 2023

大数据量全参数预训练报错、流式读数据报错 #549

Closed

YananSunn mentioned this issue Aug 31, 2023

单节点多卡A100 全量微调 CUDA error: an illegal memory access was encountered #267

Closed

liwenju0 mentioned this issue Sep 18, 2023

when running tokenizer on datasets，program crashed #954

Closed

Mr-Otaku-Lin mentioned this issue Jun 13, 2024

Qwen2-7B lora训练后推理出错 #4251

Closed

1 task

zhoushaoxiang mentioned this issue Jun 14, 2024

Ascend-D910 训练 RuntimeError: SET StreamOverflowSwitch Failed. #4284

Closed

1 task

ldknight mentioned this issue Jul 2, 2024

glm4在stage==rm微调时评估出现：CUDA error: device-side assert triggered #4646

Closed

1 task

hiennguyennq mentioned this issue Oct 21, 2024

distributed training: using GPU 0 to perform barrier as devices used by this process are currently unknown. #5769

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

QLoRA 训练报错 #13

QLoRA 训练报错 #13

Zarc98 commented Jun 6, 2023

suclogger commented Jun 6, 2023

hiyouga commented Jun 7, 2023

Zarc98 commented Jun 7, 2023

QLoRA 训练报错 #13

QLoRA 训练报错 #13

Comments

Zarc98 commented Jun 6, 2023

suclogger commented Jun 6, 2023

hiyouga commented Jun 7, 2023

Zarc98 commented Jun 7, 2023