[NPU] Support Baichuan groupwise & gw code refactor #12337

cyita · 2024-11-05T08:01:25Z

Description

Groupwise code refactor
Support baichuan groupwise

TODOs:

ov test
l0 test
remove unused code

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

5. New dependencies

New Python dependencies
- Dependency1
- Dependency2
- ...
New Java/Scala dependencies and their license
- Dependency1 and license1
- Dependency2 and license2
- ...

…chuan-gw

plusbang

Please carefully verify models, other LGTM.

plusbang · 2024-11-08T02:53:05Z

python/llm/src/ipex_llm/transformers/npu_models/common.py

    mlp_module_names = ["down_proj", "up_proj", "gate_proj"]
    if (
        isinstance(module, (Qwen2Attention, LlamaAttention))
-        or module.__class__.__name__ in ['MiniCPMAttention', 'Attention']
+        or module.__class__.__name__ in ['MiniCPMAttention']


Shall we also update the following check of module.__class__.__name__ in ['MiniCPMMLP', 'MLP']?

plusbang · 2024-11-08T02:57:04Z

python/llm/src/ipex_llm/transformers/npu_models/baichuan_mp.py

@@ -115,8 +124,10 @@ def __init__(
            attention_mask = self.create_input_op((self.batch_size, 1, 1, self.max_seq_len + 1),
                                                  dtype=np.int64)
        else:
-            attention_mask = self.create_input_op((self.batch_size, 1, self.seq_len, self.seq_len),
-                                                  dtype=np.int64)
+            # attention_mask = self.create_input_op((self.batch_size, 1, self.seq_len,


Please remove this directly.

cyita added 13 commits November 1, 2024 19:28

support minicpm 1b & qwen 1.5b gw

534dd2b

Merge remote-tracking branch 'upstream/main' into npu-gw-support

aeef9f7

support minicpm 1b

fd0f512

baichuan part

fa92302

Merge remote-tracking branch 'upstream/main' into npu-gw-support

643cc12

Merge branch 'npu-gw-support' into baichuan-gw

3a3da61

update

c17b962

support minicpm 1b & qwen 1.5b gw

33c07e7

support minicpm 1b

638fd8b

baichuan part

c3db224

update

de1fdc4

Merge branch 'baichuan-gw' of https://github.com/cyita/BigDL into bai…

e61ec44

…chuan-gw

update

cfd18a3

cyita changed the title ~~[NPU] Support Baichuan groupwise~~ [WIP][NPU] Support Baichuan groupwise Nov 5, 2024

cyita added 4 commits November 5, 2024 18:42

update

c514f8c

Merge remote-tracking branch 'upstream/main' into baichuan-gw

c3ce04d

baichuan support

8cfbdd8

code refactor

ef76837

cyita changed the title ~~[WIP][NPU] Support Baichuan groupwise~~ [NPU] Support Baichuan groupwise Nov 7, 2024

cyita marked this pull request as ready for review November 7, 2024 10:18

cyita added 4 commits November 7, 2024 18:47

remove code

4a5cdd6

Merge remote-tracking branch 'upstream/main' into baichuan-gw

7d0e573

Merge remote-tracking branch 'upstream/main' into baichuan-gw

7e6324f

fix style

0d4a498

cyita requested a review from plusbang November 8, 2024 02:43

cyita changed the title ~~[NPU] Support Baichuan groupwise~~ [NPU] Support Baichuan groupwise & gw code refactor Nov 8, 2024

plusbang approved these changes Nov 8, 2024

View reviewed changes

cyita added 2 commits November 8, 2024 11:05

address comments

f995e1e

revert

43532b4

cyita merged commit b2e69a8 into intel-analytics:main Nov 8, 2024
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[NPU] Support Baichuan groupwise & gw code refactor #12337

[NPU] Support Baichuan groupwise & gw code refactor #12337

cyita commented Nov 5, 2024 •

edited

Loading

plusbang left a comment

plusbang Nov 8, 2024

plusbang Nov 8, 2024

[NPU] Support Baichuan groupwise & gw code refactor #12337

[NPU] Support Baichuan groupwise & gw code refactor #12337

Conversation

cyita commented Nov 5, 2024 • edited Loading

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

plusbang left a comment

Choose a reason for hiding this comment

plusbang Nov 8, 2024

Choose a reason for hiding this comment

plusbang Nov 8, 2024

Choose a reason for hiding this comment

cyita commented Nov 5, 2024 •

edited

Loading