Add disable optimization params for awq #12641

cyita · 2025-01-02T06:35:25Z

Description

Add disable_optimize_pre in from_pretrained to disable merge qkv for AWQ.
Add disable_fp16_opt in FP16Linear to disable transpose weight for AWQ.

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

N/A
Unit test: Please manually trigger the PR Validation here by inputting the PR number (e.g., 1234). And paste your action link here once it has been successfully finished.
Application test
Document test
...

5. New dependencies

New Python dependencies
- Dependency1
- Dependency2
- ...
New Java/Scala dependencies and their license
- Dependency1 and license1
- Dependency2 and license2
- ...

rnwang04 · 2025-01-02T06:58:12Z

python/llm/src/ipex_llm/transformers/low_bit_linear.py

@@ -764,6 +764,7 @@ def __init__(self, input_features, output_features, bias=True,
        # weigh_type = 3 means weight has been transposed by esimd method
        self.weight_type = 1
        self.optimize_lm_head = optimize_lm_head
+        self.disable_fp16_opt = False


I wonder where did you set it to True for your use case ?

On the AWQ side.

rnwang04

LGTM

cyita · 2025-01-02T07:36:08Z

PR Validation: https://github.com/intel-analytics/ipex-llm-workflow/actions/runs/12578787560
Error due to outdated falcon model.

cyita added 2 commits January 2, 2025 14:31

add disable opts for awq

1bf9420

Merge remote-tracking branch 'upstream/main' into awq-support

24d2604

cyita requested a review from rnwang04 January 2, 2025 06:40

rnwang04 reviewed Jan 2, 2025

View reviewed changes

rnwang04 approved these changes Jan 2, 2025

View reviewed changes

cyita merged commit 8e5328e into intel:main Jan 2, 2025
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add disable optimization params for awq #12641

Add disable optimization params for awq #12641

cyita commented Jan 2, 2025

rnwang04 Jan 2, 2025

cyita Jan 2, 2025

rnwang04 left a comment

cyita commented Jan 2, 2025

Add disable optimization params for awq #12641

Add disable optimization params for awq #12641

Conversation

cyita commented Jan 2, 2025

Description

1. Why the change?

2. User API changes

3. Summary of the change

4. How to test?

5. New dependencies

rnwang04 Jan 2, 2025

Choose a reason for hiding this comment

cyita Jan 2, 2025

Choose a reason for hiding this comment

rnwang04 left a comment

Choose a reason for hiding this comment

cyita commented Jan 2, 2025