[Compression] Add bias correction feature for PTQ quantizer #5603

Bonytu · 2023-06-09T13:28:22Z

Description

Bias correction in post-training quantization refers to the process of adjusting and fine-tuning the quantized model's weights and biases to reduce the discrepancies between the quantized model's predictions and the original full-precision model's predictions. Post-training quantization is a technique used to reduce the memory and computation requirements of deep learning models by converting the model's parameters, such as weights and biases, into lower-precision representations (e.g., from 32-bit floating-point numbers to 8-bit integers). This can lead to some loss of accuracy due to the reduced numerical precision. Bias correction aims to mitigate this accuracy loss by correcting the systematic errors introduced during the quantization process, ultimately improving the overall performance of the quantized model.

Test Options

fast test
full test - HPO
full test - NAS
full test - compression

Checklist

test case
doc

How to test

nni/contrib/compression/base/wrapper.py

QuanluZhang · 2023-06-26T01:53:42Z

could you briefly explain what is bias correction feature in this pr's description?

nni/contrib/compression/quantization/lsqplus_quantizer.py

Bonytu · 2023-06-26T04:27:52Z

could you briefly explain what is bias correction feature in this pr's description?

Bias correction in post-training quantization refers to the process of adjusting and fine-tuning the quantized model's weights and biases to reduce the discrepancies between the quantized model's predictions and the original full-precision model's predictions. Post-training quantization is a technique used to reduce the memory and computation requirements of deep learning models by converting the model's parameters, such as weights and biases, into lower-precision representations (e.g., from 32-bit floating-point numbers to 8-bit integers). This can lead to some loss of accuracy due to the reduced numerical precision. Bias correction aims to mitigate this accuracy loss by correcting the systematic errors introduced during the quantization process, ultimately improving the overall performance of the quantized model.

xinzhang3 added 4 commits May 25, 2023 07:39

fix update config bug

ea1a6ba

add lsq_plus quantizer and fix wrapper bug

2859979

Merge remote-tracking branch 'upstream_master/master' into quant_lora

f8f824b

add bias correction feature for ptq quantizer

ff2ff6e

Bonytu requested review from J-shang and QuanluZhang June 12, 2023 02:37

Bonytu mentioned this pull request Jun 19, 2023

NNI v3.0 iteration plan #5556

Open

xinzhang3 added 6 commits June 19, 2023 04:25

fix_comments

c4a0404

fix pylint bug

44a9788

Merge remote-tracking branch 'origin/bug_fix' into quant_lora

c0202d0

fix type check error

8afba50

Merge remote-tracking branch 'origin/master' into quant_lora

192fff0

Merge remote-tracking branch 'origin/master' into ptq_fts

e272689

QuanluZhang reviewed Jun 26, 2023

View reviewed changes

nni/contrib/compression/base/wrapper.py Outdated Show resolved Hide resolved

QuanluZhang reviewed Jun 26, 2023

View reviewed changes

nni/contrib/compression/quantization/lsqplus_quantizer.py Outdated Show resolved Hide resolved

Merge branch 'quant_lora' into ptq_fts

e6d0b83

xinzhang3 added 2 commits June 26, 2023 04:38

fix comments

8e4f637

fix lint

dc482b0

QuanluZhang approved these changes Jun 26, 2023

View reviewed changes

xinzhang3 added 2 commits June 26, 2023 12:06

fix lint doc error

307022f

Merge remote-tracking branch 'upstream/master' into ptq_fts

933f179

J-shang approved these changes Jun 27, 2023

View reviewed changes

xinzhang3 added 2 commits June 27, 2023 06:27

fix conflict

2d0196c

Merge remote-tracking branch 'upstream/master' into ptq_fts

c0c50c8

Bonytu merged commit 5e22f49 into microsoft:master Jun 29, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Compression] Add bias correction feature for PTQ quantizer #5603

[Compression] Add bias correction feature for PTQ quantizer #5603

Bonytu commented Jun 9, 2023 •

edited

Loading

QuanluZhang commented Jun 26, 2023

Bonytu commented Jun 26, 2023

[Compression] Add bias correction feature for PTQ quantizer #5603

[Compression] Add bias correction feature for PTQ quantizer #5603

Conversation

Bonytu commented Jun 9, 2023 • edited Loading

Description

Test Options

Checklist

How to test

QuanluZhang commented Jun 26, 2023

Bonytu commented Jun 26, 2023

Bonytu commented Jun 9, 2023 •

edited

Loading