-
Notifications
You must be signed in to change notification settings - Fork 239
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Extend weight compression with INT8 symmetric scheme #2288
Conversation
Codecov Report
Additional details and impacted files@@ Coverage Diff @@
## develop #2288 +/- ##
===========================================
- Coverage 90.76% 85.96% -4.80%
===========================================
Files 486 487 +1
Lines 43764 43805 +41
===========================================
- Hits 39723 37659 -2064
- Misses 4041 6146 +2105
Flags with carried forward coverage won't be shown. Click here to find out more.
|
ee3511e
to
bf5c8f9
Compare
nncf/quantization/algorithms/weight_compression/openvino_backend.py
Outdated
Show resolved
Hide resolved
nncf/quantization/algorithms/weight_compression/openvino_backend.py
Outdated
Show resolved
Hide resolved
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Special thanks for refactoring and fixing issue with printing statistics
INT4_SYM = "int4_sym" | ||
INT4_ASYM = "int4_asym" | ||
NF4 = "nf4" | ||
INT8 = "int8" # Deprecated mode |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
What do you think about leaving INT8 as an alias for INT8_ASYM?
Changes
Added
INT8_SYM
compression modeReason for changes
INT8_SYM
mode can provide a better performance and required for dynamic quantizationRelated tickets
124823
Tests
Updated tests/openvino/native/quantization/test_weights_compression.py