train_sft.py中加载tokenizer耗时太长，请问是正常的吗？ #3

70557dzqc · 2023-06-02T06:19:57Z

06/02/2023 06:16:54 - INFO - utils.common - Loaded tokenizer in 569.9713776111603 seconds. 接近10分钟了。

thaumstrial · 2023-06-02T11:00:37Z

@panxb833 如果你的数据集很多而且cpu性能较差的话是正常的

hiyouga · 2023-06-04T05:22:57Z

新版代码中修改了加载逻辑，应该变快了。

hiyouga added the pending This problem is yet to be addressed label Jun 2, 2023

hiyouga added solved This problem has been already solved and removed pending This problem is yet to be addressed labels Jun 4, 2023

hiyouga closed this as completed Jun 4, 2023

DBtxy mentioned this issue Jul 27, 2023

单节点多卡A100 全量微调 CUDA error: an illegal memory access was encountered #267

Closed

godfly mentioned this issue Aug 17, 2023

大数据量全参数预训练报错、流式读数据报错 #549

Closed

HaimianYu mentioned this issue Nov 24, 2023

deepspeed多机多卡，训练以第一个batch卡住，然后报错Socket Timeout #1630

Closed

1 task

alukanlp mentioned this issue Feb 25, 2024

使用internlm2模型+deepspeed多机多卡训练报错 #2585

Closed

1 task

feria-tu mentioned this issue May 17, 2024

在昇腾npu环境下运行报错 #3779

Closed

1 task

zjxxsr mentioned this issue Jun 11, 2024

使用单机多卡微调Qwen2-72B #4205

Closed

1 task

Mr-Otaku-Lin mentioned this issue Jun 13, 2024

Qwen2-7B lora训练后推理出错 #4251

Closed

1 task

zhoushaoxiang mentioned this issue Jun 14, 2024

Ascend-D910 训练 RuntimeError: SET StreamOverflowSwitch Failed. #4284

Closed

1 task

ldknight mentioned this issue Jul 2, 2024

glm4在stage==rm微调时评估出现：CUDA error: device-side assert triggered #4646

Closed

1 task

xiao-liya mentioned this issue Jul 14, 2024

微调模型时数据集加载报错 #4814

Closed

1 task

fuqiang-benz mentioned this issue Jul 15, 2024

昇腾910b推理baichuan2-13B模型报错：The operator 'aten::isin.Tensor_Tensor_out' is not currently supported on the NPU backend and will 待解决 #4836

Closed

1 task

hecheng64 mentioned this issue Sep 16, 2024

多机多卡运行报错 #5450

Closed

1 task

Hansen06 mentioned this issue Sep 27, 2024

多机多卡训练想问下现在是不支持accelerate launch训练吗 #5558

Closed

1 task

alphanlp mentioned this issue Oct 5, 2024

Meet c10::DistBackendError when finetuning Qwen2-VL with video dataset #5417

Closed

1 task

hiennguyennq mentioned this issue Oct 21, 2024

distributed training: using GPU 0 to perform barrier as devices used by this process are currently unknown. #5769

Open

bisque-qwe mentioned this issue Dec 6, 2024

WorkNCCL #6269

Closed

1 task

asdksadsad mentioned this issue Dec 19, 2024

昇腾NPU使用API推理报错 #3796

Closed

1 task

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

train_sft.py中加载tokenizer耗时太长，请问是正常的吗？ #3

train_sft.py中加载tokenizer耗时太长，请问是正常的吗？ #3

70557dzqc commented Jun 2, 2023

thaumstrial commented Jun 2, 2023

hiyouga commented Jun 4, 2023 •

edited

Loading

train_sft.py中加载tokenizer耗时太长，请问是正常的吗？ #3

train_sft.py中加载tokenizer耗时太长，请问是正常的吗？ #3

Comments

70557dzqc commented Jun 2, 2023

thaumstrial commented Jun 2, 2023

hiyouga commented Jun 4, 2023 • edited Loading

hiyouga commented Jun 4, 2023 •

edited

Loading