Add ipex-llm npu option in setup.py #11858

sgwhat · 2024-08-20T05:38:09Z

Description

1. Why the change?

This PR provides pip install ipex-llm[npu] support for windows device.

2. User API changes

Previous version to run multi-processes npu example:

pip install ipex-llm[all]
pip install bigdl-core-npu
pip install transformers==4.40.0

Current version:

pip install ipex-llm[npu]

3. Summary of the change

4. How to test?

Local test

Oscilloscope98 · 2024-08-20T05:49:55Z

python/llm/setup.py

+    for exclude_require in cpu_transformers_version:
+        npu_requires.remove(exclude_require)
+    npu_requires += ["transformers==4.40.0",
+                     "bigdl-core-npu==" + CORE_XE_VERSION]


Should we add ;platform_system=='Windows'" ?

Do we need to raise an error when Linux users try to install ipex-llm[npu]? @jason-dai

Oscilloscope98

others LGTM

* feat: update readme for ppl test * fix: textual adjustments * fix: textual adjustments * Add ipex-llm npu option in setup.py (#11858) * add ipex-llm npu release * update example doc * meet latest release changes * optimize phi3 memory usage (#11867) * Update `ipex-llm` default transformers version to 4.37.0 (#11859) * Update default transformers version to 4.37.0 * Add dependency requirements for qwen and qwen-vl * Temp fix transformers version for these not yet verified models * Skip qwen test in UT for now as it requires transformers<4.37.0 * Update performance test regarding updated default `transformers==4.37.0` (#11869) * Update igpu performance from transformers 4.36.2 to 4.37.0 (#11841) * upgrade arc perf test to transformers 4.37 (#11842) * fix load low bit com dtype (#11832) * feat: add mixed_precision argument on ppl longbench evaluation * fix: delete extra code * feat: upgrade arc perf test to transformers 4.37 * fix: add missing codes * fix: keep perf test for qwen-vl-chat in transformers 4.36 * fix: remove extra space * fix: resolve pr comment * fix: add empty line * fix: add pip install for spr and core test * fix: delete extra comments * fix: remove python -m for pip * Revert "fix load low bit com dtype (#11832)" This reverts commit 6841a9a. --------- Co-authored-by: Zhao Changmin <[email protected]> Co-authored-by: Jinhe Tang <[email protected]> * add transformers==4.36 for qwen vl in igpu-perf (#11846) * add transformers==4.36.2 for qwen-vl * Small update --------- Co-authored-by: Yuwen Hu <[email protected]> * fix: remove qwen-7b on core test (#11851) * fix: remove qwen-7b on core test * fix: change delete to comment --------- Co-authored-by: Jinhe Tang <[email protected]> * replce filename (#11854) * fix: remove qwen-7b on core test * fix: change delete to comment * fix: replace filename --------- Co-authored-by: Jinhe Tang <[email protected]> * fix: delete extra comments (#11863) * Remove transformers installation for temp test purposes * Small fix * Small update --------- Co-authored-by: Chu,Youcheng <[email protected]> Co-authored-by: Zhao Changmin <[email protected]> Co-authored-by: Jinhe Tang <[email protected]> Co-authored-by: Zijie Li <[email protected]> Co-authored-by: Chu,Youcheng <[email protected]> * Pytorch models transformers version update (#11860) * yi sync * delete 4.34 constraint * delete 4.34 constraint * delete 4.31 constraint * delete 4.34 constraint * delete 4.35 constraint * added <=4.33.3 constraint * added <=4.33.3 constraint * switched to chinese prompt * Update compresskv model forward type logic (#11868) * update * fix * Update local import for ppl (#11866) Co-authored-by: jenniew <[email protected]> * fix: textual adjustment --------- Co-authored-by: SONG Ge <[email protected]> Co-authored-by: Yishuo Wang <[email protected]> Co-authored-by: Yuwen Hu <[email protected]> Co-authored-by: Zhao Changmin <[email protected]> Co-authored-by: Jinhe Tang <[email protected]> Co-authored-by: Zijie Li <[email protected]> Co-authored-by: Yina Chen <[email protected]> Co-authored-by: RyuKosei <[email protected]> Co-authored-by: jenniew <[email protected]>

sgwhat added 2 commits August 20, 2024 13:33

add ipex-llm npu release

17e7be0

update example doc

c71c158

sgwhat requested a review from Oscilloscope98 August 20, 2024 05:43

Oscilloscope98 reviewed Aug 20, 2024

View reviewed changes

sgwhat added 2 commits August 20, 2024 14:08

meet latest release changes

9b0b0e0

update

0a7dc9b

Oscilloscope98 approved these changes Aug 20, 2024

View reviewed changes

sgwhat added 2 commits August 20, 2024 17:28

revert to merge

1dc6053

merge

5158fb3

sgwhat merged commit 5b83493 into intel-analytics:main Aug 20, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add ipex-llm npu option in setup.py #11858

Add ipex-llm npu option in setup.py #11858

sgwhat commented Aug 20, 2024 •

edited

Loading

Oscilloscope98 Aug 20, 2024

sgwhat Aug 20, 2024 •

edited

Loading

Oscilloscope98 left a comment

Add ipex-llm npu option in setup.py #11858

Add ipex-llm npu option in setup.py #11858

Conversation

sgwhat commented Aug 20, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

Oscilloscope98 Aug 20, 2024

Choose a reason for hiding this comment

sgwhat Aug 20, 2024 • edited Loading

Choose a reason for hiding this comment

Oscilloscope98 left a comment

Choose a reason for hiding this comment

sgwhat commented Aug 20, 2024 •

edited

Loading

sgwhat Aug 20, 2024 •

edited

Loading