Quick start: Install bigdl-llm on windows gpu #10195

ivy-lv11 · 2024-02-21T07:27:06Z

Quick start: Install bigdl-ll on windows gpu

jason-dai · 2024-02-21T07:56:10Z

Change the file name to install_windows_gpu.md
Do not put images on https://github.com/intel-analytics/BigDL/; put them in https://llm-assets.readthedocs.io/

* support name mapping for mixtral * support mixtral mixed quantization * fix style * fix

ivy-lv11 · 2024-02-21T08:08:05Z

Change the file name to install_windows_gpu.md

Do not put images on https://github.com/intel-analytics/BigDL/; put them in https://llm-assets.readthedocs.io/

Sure, I will rename and move the figs.

shane-huang · 2024-02-21T08:15:51Z

The size of the two visual studio figures (i.e. fig1 and fig2) are too large - rescale the figures and put them side-by-side.

shane-huang · 2024-02-21T08:19:18Z

I don't think we need fig3. Add a figure to use windows task manager to check iGPU/GPU status, etc.

shane-huang · 2024-02-21T08:24:05Z

docs/readthedocs/source/doc/LLM/QuickStart/Install BigDL-LLM on Windows for Intel GPU.md

+1. Step 1: Run the commands below in Anaconda prompt. 
+
+```bash
+conda create -n llm python=3.9 libuv # Already done in "Install conda" section


you already created llm env before, so just remove this line to avoid confusing.

shane-huang · 2024-02-21T08:24:41Z

docs/readthedocs/source/doc/LLM/QuickStart/Install BigDL-LLM on Windows for Intel GPU.md

+```bash
+conda create -n llm python=3.9 libuv # Already done in "Install conda" section
+conda activate llm
+pip install dpcpp-cpp-rt==2024.0.2 mkl-dpcpp==2024.0.0 onednn==2024.0.0 # Already done in "Install oneAPI" section


remove one-api as it is done in previous section.

shane-huang · 2024-02-21T08:25:43Z

docs/readthedocs/source/doc/LLM/QuickStart/Install BigDL-LLM on Windows for Intel GPU.md

+   from transformers import AutoTokenizer, GenerationConfig
+   ```
+
+   Then we use phi-1.5 as an example to show how to run the model with bigdl-llm on windows. 


make the phi-1.5 example in a new section "A Quick Example"

shane-huang · 2024-02-21T08:37:45Z

docs/readthedocs/source/doc/LLM/QuickStart/Install BigDL-LLM on Windows for Intel GPU.md

+   generation_config = GenerationConfig(use_cache = True)
+
+   if __name__ == '__main__':
+       parser = argparse.ArgumentParser(description='Predict Tokens using `generate()` API for phi-1_5 model')


Make this example as simple as possible without much code.

remove arg parse section (put just arg values in code)

remove timing code

make the comments concise

* remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl

* Make Offline installer as default for win gpu doc for oneAPI * Small other fixes

…a/*/pom.xml (#10197) * Bump org.apache.commons:commons-compress in /scala/serving Bumps org.apache.commons:commons-compress from 1.21 to 1.26.0. --- updated-dependencies: - dependency-name: org.apache.commons:commons-compress dependency-type: direct:production ... Signed-off-by: dependabot[bot] <[email protected]> * Bumps org.apache.commons:commons-compress from 1.21 to 1.26.0. * Bumps org.apache.commons:commons-compress from 1.21 to 1.26.0. * Bumps org.apache.commons:commons-compress from 1.21 to 1.26.0. * Bumps org.apache.commons:commons-compress from 1.21 to 1.26.0. --------- Signed-off-by: dependabot[bot] <[email protected]> Co-authored-by: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com> Co-authored-by: Shaojun Liu <[email protected]>

* Add c-eval workflow and modify running files * Modify the chatglm evaluator file * Modify the ceval workflow for triggering test * Modify the ceval workflow file * Modify the ceval workflow file * Modify ceval workflow * Adjust the ceval dataset download * Add ceval workflow dependencies * Modify ceval workflow dataset download * Add ceval test dependencies * Add ceval test dependencies * Correct the result print * Fix the nightly test trigger time * Fix ChatGLM loading issue

* add esimd sdp support * fix style

* add quantize kv_cache for baichuan2-13b * style fix

* add mlp layer unit tests * add download baichuan-13b * exclude llama for now * install additional packages * rename bash file * switch to Baichuan2 * delete attention related code * fix name errors in yml file

* Add model loading time record in csv for all-in-one benchmark * Small fix * Small fix to number after .

* add iq2 examples * small fix * meet code review * fix * meet review * small fix

jason-dai · 2024-02-22T07:54:14Z

docs/readthedocs/source/doc/LLM/QuickStart/install_windows_gpu.md

+* Run the commands below in Anaconda prompt. Please note that transformer version should match the model you want to use. For example, here we use transformers 4.37.0 to run the demo. 
+
+  ```bash
+  conda activate llm
+
+  pip install --pre --upgrade bigdl-llm[xpu] -f https://developer.intel.com/ipex-whl-stable-xpu
+  pip install transformers==4.37.0 
+  ```


Move transformers related info to the example section

jason-dai · 2024-02-22T07:55:00Z

docs/readthedocs/source/doc/LLM/QuickStart/install_windows_gpu.md

+* Now we can test whether all the components have been installed correctly. If we can import all the packages correctly following the python file below, then the installation is correct. 
+  ```python
+  import torch
+  import time
+  import argparse
+  import numpy as np
+
+  from bigdl.llm.transformers import AutoModel,AutoModelForCausalLM
+  from transformers import AutoTokenizer, GenerationConfig
+  ```


How does the user run this? Maybe it's easy to run it in the python prompt?

jason-dai · 2024-02-22T07:55:23Z

docs/readthedocs/source/doc/LLM/QuickStart/install_windows_gpu.md

+          print('-'*20, 'Output', '-'*20)
+          print(output_str)
+   ```
+   Here is the sample output on the laptop equipped with 11th Gen Intel(R) Core(TM) i7-1185G7 and Intel(R) Iris(R) Xe Graphics after running the example program above. 


How does the user run this example?

We provide the contents of demo.py and users could run it as python demo.py.

Update IPEX to 2.2.0+cpu and refactor for _ipex_optimize.

* optimize * update * fix style & move use_fuse_rope * add ipex version check * fix style * update * fix style * meet comments * address comments * fix style

…nto install-win-gpu

shane-huang · 2024-02-23T05:51:14Z

clean up the unused files.

ivy-lv11 added 6 commits February 21, 2024 15:14

add windows quick start

c875f38

modify fig size

e1a10ce

update

f7a8fab

modify demo

b7dc6dd

add sample output

c12c40f

modify link

74b2638

ivy-lv11 and others added 2 commits February 21, 2024 15:56

add cpu_embedding

da43ab1

LLM: support iq2 for mixtral (#10191)

6cc7c9f

* support name mapping for mixtral * support mixtral mixed quantization * fix style * fix

shane-huang reviewed Feb 21, 2024

View reviewed changes

Update README (#10186)

e93951b

shane-huang reviewed Feb 21, 2024

View reviewed changes

plusbang and others added 14 commits February 21, 2024 16:40

LLM: add qlora finetuning example using trl.SFTTrainer (#10183)

6e12fec

Change the nightly test time of ppl and harness (#10198)

66cde46

* remove include and language option, select the corresponding dataset based on the model name in Run * change the nightly test time * change the nightly test time of harness and ppl

[LLM] Small updates to Win GPU Install Doc (#10199)

5baff9b

* Make Offline installer as default for win gpu doc for oneAPI * Small other fixes

add pdf

82c5032

pdf

787fc29

scale the figs

bbd7049

rename

b702e25

typo

556848f

resize

8f22a42

resize fig

7609401

resize pic

34ce4ad

resize

50bd8fd

modify format

247f05d

NovTi and others added 14 commits February 22, 2024 10:00

update code style

8e84efb

run on arc

951df11

add GPU info

eceff71

update fig

b25e5ab

update transformers

6043bab

LLM: add esimd sdp support for chatglm3 (#10205)

c031da2

* add esimd sdp support * fix style

[LLM] Add quantize kv_cache for Baichuan2-13B (#10203)

9e3422c

* add quantize kv_cache for baichuan2-13b * style fix

LLM: Add mlp layer unit tests (#10200)

d0e1459

* add mlp layer unit tests * add download baichuan-13b * exclude llama for now * install additional packages * rename bash file * switch to Baichuan2 * delete attention related code * fix name errors in yml file

[LLM] Add model loading time record for all-in-one benchmark (#10201)

64d40b4

* Add model loading time record in csv for all-in-one benchmark * Small fix * Small fix to number after .

modify pics path

c9d8420

change path

f2dd41c

LLM: add GGUF-IQ2 examples (#10207)

1a4dbbf

* add iq2 examples * small fix * meet code review * fix * meet review * small fix

Support for MPT rotary embedding (#10208)

d5c4e47

jason-dai changed the title ~~Quick start: Install bigdl-ll on windows gpu~~ Quick start: Install bigdl-llm on windows gpu Feb 22, 2024

ivy-lv11 added 2 commits February 22, 2024 15:34

modify path

1793305

update path

6c9c48e

jason-dai reviewed Feb 22, 2024

View reviewed changes

xiangyuT and others added 5 commits February 22, 2024 16:01

LLM: Update IPEX to 2.2.0+cpu and Refactor for _ipex_optimize (#10189)

c6dbd57

Update IPEX to 2.2.0+cpu and refactor for _ipex_optimize.

modify usage

2615a9b

GPT-J rope optimization on xpu (#10182)

4afdcfc

* optimize * update * fix style & move use_fuse_rope * add ipex version check * fix style * update * fix style * meet comments * address comments * fix style

Merge branch 'install-win-gpu' of https://github.com/ivy-lv11/BigDL i…

5c79d89

…nto install-win-gpu

remove figs

714cd77

ivy-lv11 requested review from glorysdj and liu-shaojun as code owners February 22, 2024 08:39

ivy-lv11 closed this Feb 23, 2024

ivy-lv11 deleted the install-win-gpu branch March 29, 2024 01:02

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Quick start: Install bigdl-llm on windows gpu #10195

Quick start: Install bigdl-llm on windows gpu #10195

ivy-lv11 commented Feb 21, 2024

jason-dai commented Feb 21, 2024

ivy-lv11 commented Feb 21, 2024

shane-huang commented Feb 21, 2024

shane-huang commented Feb 21, 2024

shane-huang Feb 21, 2024

shane-huang Feb 21, 2024

shane-huang Feb 21, 2024 •

edited

Loading

shane-huang Feb 21, 2024 •

edited

Loading

jason-dai Feb 22, 2024

jason-dai Feb 22, 2024

jason-dai Feb 22, 2024

ivy-lv11 Feb 22, 2024

shane-huang commented Feb 23, 2024

Quick start: Install bigdl-llm on windows gpu #10195

Quick start: Install bigdl-llm on windows gpu #10195

Conversation

ivy-lv11 commented Feb 21, 2024

jason-dai commented Feb 21, 2024

ivy-lv11 commented Feb 21, 2024

shane-huang commented Feb 21, 2024

shane-huang commented Feb 21, 2024

shane-huang Feb 21, 2024

Choose a reason for hiding this comment

shane-huang Feb 21, 2024

Choose a reason for hiding this comment

shane-huang Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

shane-huang Feb 21, 2024 • edited Loading

Choose a reason for hiding this comment

jason-dai Feb 22, 2024

Choose a reason for hiding this comment

jason-dai Feb 22, 2024

Choose a reason for hiding this comment

jason-dai Feb 22, 2024

Choose a reason for hiding this comment

ivy-lv11 Feb 22, 2024

Choose a reason for hiding this comment

shane-huang commented Feb 23, 2024

shane-huang Feb 21, 2024 •

edited

Loading

shane-huang Feb 21, 2024 •

edited

Loading