Feature (quantizer): Adding weight normalization-based integer quantization #559

i-colbert · 2023-03-20T16:07:44Z

Adding experimental narrow per-channel weight normalization-based signed integer quantizer based on Quantized Neural Networks for Low-Precision Accumulation with Guaranteed Overflow Avoidance, by I. Colbert, A. Pappalardo, and J. Petri-Koenig, with support for both L1 and L2 weight normalization.

Using the decoupled rescaling integer quantization arithmetic where the weight normalization statistics calculation (d_w) and norm vector parameterization (g) are combined with the scaling factor to become the pre-clipping scaling factor (i.e., pre_scale) and the conventional scaling factor (s) is the post-clipping scaling factor (i.e., post_scale). For further details on the arithmetic, see ParameterPreScalingWeightNorm. For further details on the weight normalization-based quantization technique, see the referenced paper.

i-colbert marked this pull request as ready for review March 21, 2023 22:07

i-colbert changed the title ~~Feature (quantizer): Adding WeightNormIntQuant~~ Feature (quantizer): Adding weight normalization-based integer quantization Mar 22, 2023

volcacius self-requested a review March 23, 2023 16:03

volcacius force-pushed the dev branch from fd73313 to 10edc24 Compare March 24, 2023 10:19

i-colbert force-pushed the icolbert/wniq branch from a007f71 to 64dec7c Compare March 24, 2023 17:55

i-colbert added 16 commits March 24, 2023 11:01

Adding weight normalization-based integer quantizer

4991a0d

Pre-commit fixes

7b3f691

Adding variable for p-norm

d84b52b

Adding SingleArgStatelessBuffer

b4ad793

Adding L1Norm as scaling_stats_impl for weight normalization

119bcb3

Adding L2Norm for normalize_stats_impl

27da048

Adding ParameterPreScalingWeightNorm

8251396

Adding modules to top-level imports

f740c09

Removing WeightNormIntQuant

96045e7

Updating list of modules in pre_scaling.py

70ecfa0

Adding WeightNormPerChannelFloatDecoupled

f82db6d

Adding Int8WeightNormL2PerChannelFixedPoint injector

3276d79

Fixing L2Norm initialization

6729147

Typo fix

fbbf543

Pre-commit fixes

8609858

Adding quant_decoupled to WBIOL weight quantizer tests

6aefeef

i-colbert force-pushed the icolbert/wniq branch from 64dec7c to 6aefeef Compare March 24, 2023 18:01

volcacius approved these changes Mar 24, 2023

View reviewed changes

volcacius merged commit 735b183 into Xilinx:dev Mar 24, 2023

i-colbert deleted the icolbert/wniq branch March 24, 2023 18:17

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Feature (quantizer): Adding weight normalization-based integer quantization #559

Feature (quantizer): Adding weight normalization-based integer quantization #559

i-colbert commented Mar 20, 2023 •

edited

Loading

Feature (quantizer): Adding weight normalization-based integer quantization #559

Feature (quantizer): Adding weight normalization-based integer quantization #559

Conversation

i-colbert commented Mar 20, 2023 • edited Loading

i-colbert commented Mar 20, 2023 •

edited

Loading