ChatGLM3-6B LoRA Fine-tuning Demo #11450

Uxito-Ada · 2024-06-27T07:26:49Z

Description

Port ChatGLM repo's finetune demo to ipex-llm, running on Arc XPU.

a single Arc card
two Arc cards (needs deepspeed)

1. Why the change?

as above

2. User API changes

will replace the old chatglm lora finetuning.

3. Summary of the change

ChatGLM3-6B LoRA Fine-tuning Demo

4. How to test?

5. New dependencies

pip install "jieba>=0.42.1"
pip install "ruamel_yaml>=0.18.6"
pip install "rouge_chinese>=1.0.3"
pip install "jupyter>=1.0.0"
pip install "typer"
pip install "nltk"

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/lora_finetune_chatglm.py

Uxito-Ada · 2024-06-27T07:36:16Z

The README.md is WIP, which will append sections of 2-cards finetuning and how to serve with the outputs.

qiyuangong · 2024-06-27T07:41:27Z

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/README.md

+pip install "numpy<2.0.0"
+# below command will install intel_extension_for_pytorch==2.1.10+xpu as default
+pip install --pre --upgrade ipex-llm[xpu] --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/
+pip install oneccl_bind_pt==2.1.100 --extra-index-url https://pytorch-extension.intel.com/release-whl/stable/xpu/us/


Single ARC doesn't need oneccl

This is necessary, as XPU accelerator needs CCL. Without CCL, accelerator will switch to CUDA, and trainer will schedule model to CPU rather than XPU.

qiyuangong · 2024-06-27T07:42:54Z

Add Lora link to https://github.com/intel-analytics/ipex-llm/tree/38b3bf691b26aacfb63b5818a092bfa4e1cb4771/python/llm/example/GPU/LLM-Finetuning#verified-models

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/README.md

qiyuangong

Other, LGTM

jason-dai · 2024-06-27T08:16:47Z

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/process_advertise_gen_dataset.py

+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+# This is ported from https://github.com/THUDM/ChatGLM3/blob/main/finetune_demo/lora_finetune.ipynb


please highlight in the code which lines need to be changed for running on Arc.

done, as well as lora_finetune_chatglm.py.

jason-dai · 2024-06-27T14:23:26Z

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/lora_finetune_chatglm.py

+            from ipex_llm.transformers import AutoModelForCausalLM
+            from ipex_llm.transformers.qlora import get_peft_model
+            import os
+            os.environ["ACCELERATE_USE_XPU"] = "true"
+            model = AutoModelForCausalLM.from_pretrained(
+                model_dir,
+                trust_remote_code=True,
+                load_in_low_bit="bf16",
+                optimize_model=False,
+                empty_init=False,
+                use_cache=False,
+                torch_dtype=torch.bfloat16
+            )


Make it clear that L414-426 have been modified (e.g., add begin and end of change comments)

do we need to use load_in_low_bit="bf16"?

Is it possible to use llm_patch to minimize the changes?

Turned to use llm_patch.

jason-dai · 2024-06-28T07:16:25Z

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/lora_finetune_chatglm.py

+            )
+        if peft_config.peft_type.name == "LORA":
+            # Add below L417 to enable accelerator to schedule model to Intel Arc XPU
+            os.environ["ACCELERATE_USE_XPU"] = "true"


shall we add this to llm_patch? @qiyuangong @plusbang ?

If we can use llm_patch to minimize changes, then yes.

sure, maybe we could add this to llm_patch just in this PR.

qiyuangong · 2024-06-28T08:58:39Z

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/README.md

+python process_advertise_gen_dataset.py
+```
+
+Then, './AdvertiseGen' will be converted to './AdvertiseGen_fix'. Now, we have prepared the dataset, and are going to start LoRA fine-tuning on ChatGLM3-6B.


' changes to ` will high this work

* ChatGLM3-6B LoRA Fine-tuning Demo * refine * refine * add 2-card deepspeed * refine format * add mpi4py and deepspeed install

ChatGLM3-6B LoRA Fine-tuning Demo

38b3bf6

Uxito-Ada requested review from qiyuangong and glorysdj June 27, 2024 07:30

qiyuangong reviewed Jun 27, 2024

View reviewed changes

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/lora_finetune_chatglm.py Outdated Show resolved Hide resolved

qiyuangong reviewed Jun 27, 2024

View reviewed changes

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/lora_finetune_chatglm.py Outdated Show resolved Hide resolved

qiyuangong reviewed Jun 27, 2024

View reviewed changes

python/llm/example/GPU/LLM-Finetuning/LoRA/chatglm_finetune/README.md Show resolved Hide resolved

qiyuangong approved these changes Jun 27, 2024

View reviewed changes

jason-dai reviewed Jun 27, 2024

View reviewed changes

refine

a3d8d60

jason-dai reviewed Jun 27, 2024

View reviewed changes

refine

5d1c83e

jason-dai reviewed Jun 28, 2024

View reviewed changes

add 2-card deepspeed

5e6ee70

qiyuangong reviewed Jun 28, 2024

View reviewed changes

Uxito-Ada added 2 commits June 28, 2024 17:30

refine format

e088926

add mpi4py and deepspeed install

d1434bf

Uxito-Ada merged commit 07362ff into intel-analytics:main Jul 1, 2024
26 checks passed

Uxito-Ada deleted the heyang_24_6_27 branch July 1, 2024 01:18

RyuKosei pushed a commit to RyuKosei/ipex-llm that referenced this pull request Jul 19, 2024

ChatGLM3-6B LoRA Fine-tuning Demo (intel-analytics#11450)

b3a7fe3

* ChatGLM3-6B LoRA Fine-tuning Demo * refine * refine * add 2-card deepspeed * refine format * add mpi4py and deepspeed install

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ChatGLM3-6B LoRA Fine-tuning Demo #11450

ChatGLM3-6B LoRA Fine-tuning Demo #11450

Uxito-Ada commented Jun 27, 2024 •

edited

Loading

Uxito-Ada commented Jun 27, 2024

qiyuangong Jun 27, 2024

Uxito-Ada Jun 27, 2024 •

edited

Loading

qiyuangong Jun 28, 2024

qiyuangong commented Jun 27, 2024

qiyuangong left a comment

jason-dai Jun 27, 2024

Uxito-Ada Jun 27, 2024

jason-dai Jun 27, 2024

Uxito-Ada Jun 28, 2024

jason-dai Jun 28, 2024

qiyuangong Jun 28, 2024

plusbang Jun 28, 2024

qiyuangong Jun 28, 2024

ChatGLM3-6B LoRA Fine-tuning Demo #11450

ChatGLM3-6B LoRA Fine-tuning Demo #11450

Conversation

Uxito-Ada commented Jun 27, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

Uxito-Ada commented Jun 27, 2024

Choose a reason for hiding this comment

Uxito-Ada Jun 27, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

qiyuangong commented Jun 27, 2024

qiyuangong left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Uxito-Ada commented Jun 27, 2024 •

edited

Loading

Uxito-Ada Jun 27, 2024 •

edited

Loading