Temporary fix for QAT quantizer when linear layer bias is True #1087

elfisworking · 2024-10-16T02:43:54Z

A temporary fix for QAT quantizer when linear layer bias is True
Link issue: Int8DynActInt4WeightQATQuantizer doesn't support qwen series #1080

pytorch-bot · 2024-10-16T02:43:57Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1087

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

✅ No Failures

As of commit b6982c0 with merge base 6ea36c5 ():
💚 Looks good so far! There are no failures yet. 💚

This comment was automatically generated by Dr. CI and updates every 15 minutes.

andrewor14 · 2024-10-16T14:56:10Z

Hi @elfisworking, thanks for fixing this. Changes look great! Would you mind adding a test in test/quantization/test_qat.py for the bias case? Just to check that bias is in fact True after QAT module replacement.

andrewor14 · 2024-10-16T14:56:55Z

torchao/quantization/GPTQ.py

@@ -617,7 +617,7 @@ def _replace_linear_int4(
    copy_weights: bool = False,
 ):
    for name, child in module.named_children():
-        if isinstance(child, nn.Linear) and (skip_layer_func is None or not skip_layer_func(child.weight)):
+        if isinstance(child, nn.Linear) and child.bias is None and (skip_layer_func is None or not skip_layer_func(child.weight)):


can you also add a TODO: support linear bias (here and L982)

Signed-off-by: yumin <[email protected]>

elfisworking · 2024-10-17T02:27:52Z

test has been added

andrewor14 · 2024-10-17T15:09:53Z

Thanks @elfisworking, merging this.

facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Oct 16, 2024

msaroufim requested a review from andrewor14 October 16, 2024 04:50

andrewor14 reviewed Oct 16, 2024

View reviewed changes

Temporary fix for QAT when linear layer bias is True

b6982c0

Signed-off-by: yumin <[email protected]>

elfisworking force-pushed the main branch from 220aae6 to b6982c0 Compare October 17, 2024 02:25

andrewor14 approved these changes Oct 17, 2024

View reviewed changes

andrewor14 merged commit 3103e7e into pytorch:main Oct 17, 2024
17 checks passed

elfisworking mentioned this pull request Oct 18, 2024

qwen2 is not supported by QAT pytorch/torchtune#1818

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Temporary fix for QAT quantizer when linear layer bias is True #1087

Temporary fix for QAT quantizer when linear layer bias is True #1087

elfisworking commented Oct 16, 2024

pytorch-bot bot commented Oct 16, 2024 •

edited

Loading

andrewor14 commented Oct 16, 2024

andrewor14 Oct 16, 2024

elfisworking commented Oct 17, 2024

andrewor14 commented Oct 17, 2024

Temporary fix for QAT quantizer when linear layer bias is True #1087

Temporary fix for QAT quantizer when linear layer bias is True #1087

Conversation

elfisworking commented Oct 16, 2024

pytorch-bot bot commented Oct 16, 2024 • edited Loading

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/ao/1087

✅ No Failures

andrewor14 commented Oct 16, 2024

andrewor14 Oct 16, 2024

Choose a reason for hiding this comment

elfisworking commented Oct 17, 2024

andrewor14 commented Oct 17, 2024

pytorch-bot bot commented Oct 16, 2024 •

edited

Loading