add attr to MatMulNBits #1378

mengniwang95 · 2023-11-07T08:46:33Z

Type of Change

feature

Description

add attr "accuracy_level" to MatMulNBits to fit our jblas kernel

default is 0, means original mlas kernel
1 means jblas compute type is fp32
2 means jblas compute type is fp16
3 means jblas compute type is bf16
4 means jblas compute type is int8

Expected Behavior & Potential Risk

the expected behavior that triggered by this PR

How has this PR been tested?

how to reproduce the test (including hardware information)

Dependency Change?

any library dependency introduced or removed

Signed-off-by: Mengni Wang <[email protected]>

chensuyue · 2023-11-14T06:59:50Z

@yuwenzho please review.

chensuyue · 2023-11-14T07:00:12Z

/azp run Model-Test

azure-pipelines · 2023-11-14T07:00:42Z

Azure Pipelines successfully started running 1 pipeline(s).

neural_compressor/adaptor/ox_utils/weight_only.py

mengniwang95 added 4 commits November 2, 2023 16:24

add attr to MatMulNBits

9887c6c

Signed-off-by: Mengni Wang <[email protected]>

Update onnxrt.py

27fbe7c

Update weight_only.py

e18fc75

Update weight_only.py

060a061

mengniwang95 requested a review from yuwenzho November 10, 2023 03:12

mengniwang95 and others added 2 commits November 10, 2023 14:15

Merge branch 'master' into mengni/4bit

28fa351

Merge branch 'master' into mengni/4bit

e550449

yuwenzho reviewed Nov 14, 2023

View reviewed changes

neural_compressor/adaptor/ox_utils/weight_only.py Outdated Show resolved Hide resolved

chensuyue added the enhancement New feature or request label Nov 16, 2023

mengniwang95 added 3 commits November 16, 2023 20:20

Update weight_only.py

b47814b

Update weight_only.py

22a7697

Merge branch 'master' into mengni/4bit

56cb425

chensuyue merged commit 7057e3b into master Nov 17, 2023
45 of 47 checks passed

chensuyue deleted the mengni/4bit branch November 17, 2023 02:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add attr to MatMulNBits #1378

add attr to MatMulNBits #1378

mengniwang95 commented Nov 7, 2023 •

edited

Loading

chensuyue commented Nov 14, 2023

chensuyue commented Nov 14, 2023

azure-pipelines bot commented Nov 14, 2023

add attr to MatMulNBits #1378

add attr to MatMulNBits #1378

Conversation

mengniwang95 commented Nov 7, 2023 • edited Loading

Type of Change

Description

Expected Behavior & Potential Risk

How has this PR been tested?

Dependency Change?

chensuyue commented Nov 14, 2023

chensuyue commented Nov 14, 2023

azure-pipelines bot commented Nov 14, 2023

mengniwang95 commented Nov 7, 2023 •

edited

Loading