Fix GPTQ/RTN 3.x example & fix asym quantize #1611

Kaihui-intel · 2024-02-18T05:10:14Z

Type of Change

bug fix

Description

asym quant:

    weight.div_(scale)
    weight.round_()
 +  weight.add_(zp)
    weight.clamp_(0, maxq)

equivalent to

q = torch.clamp(torch.round(weight / scale) + zp, 0, maxq)

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Kaihui-intel <[email protected]>

…ressor into kaihui/gptq_eg

Signed-off-by: Kaihui-intel <[email protected]>

…ressor into kaihui/gptq_eg

Signed-off-by: Kaihui-intel <[email protected]>

Signed-off-by: Tang, Kaihui <[email protected]>

Kaihui-intel added 10 commits February 5, 2024 10:23

fix gptq example

fca8de8

Signed-off-by: Kaihui-intel <[email protected]>

fix import

e2181b3

Signed-off-by: Kaihui-intel <[email protected]>

merge master

a1390da

Signed-off-by: Kaihui-intel <[email protected]>

update rtn&dq quantize

d2a752f

Signed-off-by: Kaihui-intel <[email protected]>

update gptq quantize

bcf0979

Signed-off-by: Kaihui-intel <[email protected]>

update gptq quantize

0215d3e

Signed-off-by: Kaihui-intel <[email protected]>

Merge branch 'kaihui/gptq_eg' of https://github.com/intel/neural-comp…

cc69499

…ressor into kaihui/gptq_eg

Merge branch 'kaihui/gptq_eg' of https://github.com/intel/neural-comp…

5143a00

…ressor into kaihui/gptq_eg

Merge branch 'kaihui/gptq_eg' of https://github.com/intel/neural-comp…

10b6b97

…ressor into kaihui/gptq_eg

update run_quant.sh

4b65290

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel added INC3.X PyTorch Related to PyTorch F/W labels Feb 18, 2024

Kaihui-intel and others added 6 commits February 19, 2024 09:07

add requirements

5d4a413

Signed-off-by: Kaihui-intel <[email protected]>

Merge branch 'master' into kaihui/gptq_eg

a00e1c9

fix type

896ffe9

Signed-off-by: Kaihui-intel <[email protected]>

Merge branch 'kaihui/gptq_eg' of https://github.com/intel/neural-comp…

160c6b9

…ressor into kaihui/gptq_eg

test sym

3851943

Signed-off-by: Kaihui-intel <[email protected]>

reset asym & fix asym quant

b180f8d

Signed-off-by: Kaihui-intel <[email protected]>

Kaihui-intel changed the title ~~Fix GPTQ/RTN 3.x example~~ Fix GPTQ/RTN 3.x example & fix asym quantize Feb 20, 2024

Kaihui-intel requested review from xin3he and chensuyue February 21, 2024 01:27

chensuyue approved these changes Feb 21, 2024

View reviewed changes

xin3he approved these changes Feb 21, 2024

View reviewed changes

Kaihui-intel added 2 commits February 21, 2024 11:11

set ggml dq sym

4e564d8

Signed-off-by: Tang, Kaihui <[email protected]>

Merge branch 'master' into kaihui/gptq_eg

9c09ad0

Kaihui-intel merged commit 813d930 into master Feb 21, 2024
22 checks passed

Kaihui-intel deleted the kaihui/gptq_eg branch February 21, 2024 05:03

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix GPTQ/RTN 3.x example & fix asym quantize #1611

Fix GPTQ/RTN 3.x example & fix asym quantize #1611

Kaihui-intel commented Feb 18, 2024 •

edited

Loading

Fix GPTQ/RTN 3.x example & fix asym quantize #1611

Fix GPTQ/RTN 3.x example & fix asym quantize #1611

Conversation

Kaihui-intel commented Feb 18, 2024 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

Kaihui-intel commented Feb 18, 2024 •

edited

Loading