-
Notifications
You must be signed in to change notification settings - Fork 240
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[Torch] INT4 weight compression #3014
Conversation
7906fa8
to
c1d9fd7
Compare
d4df25c
to
b4d13b8
Compare
) | ||
|
||
if backup_mode is not None: | ||
raise AttributeError("TorchFX backend does not support backup_mode option.") | ||
if ratio is not None and ratio != 1: |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
As far as I can see, Torch and Torch FX have the same processing of parameters. Does it make sense to combine them in a single function?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Thanks for this comment. Thanks for this comment. We are going to refactor it when the torch.fx backend will be ready to move out from experiments module.
cc' @daniil-lyakhov, @anzr299
a6e4a47
to
2632e99
Compare
2632e99
to
9a1432c
Compare
Changes
INT4SymmetricWeightsDecompressor
andINT4ASymmetricWeightsDecompressor
Reason for changes
Support INT4 weight model compression of PyTorch models in NNCF.
Related tickets
#3005
Tests
updated tests