[Torch] Serialize and load NNCF transformations #2531

daniil-lyakhov · 2024-02-28T14:14:46Z

On top of #2584 #2595

Changes

serialize_transformations and load_transformations functions are introduced: serialize_transformations could serialize PTTransformationLayout to a dict which could be serialized by json; serialized transformation could be recovered by the load_transformations function.
StatefullTorchModuleInterface is introduced to make it possible to serialize all compression modules for quantization, sparisification, weights compression and pruning algorithms
Quantizers are created with scale shape specified in the quantization spec

Reason for changes

To make it possible to serialize/deserialize PT nncf transformations
To make it possible to serialize/deserialize compression modules which are the part of each transformation
To align scales shapes after nncf.quantize and quantizes initialization

Related tickets

129586

Tests

tests/torch/test_serialization.py
tests/torch/test_serialization.py
tests/torch/test_serialization.py

codecov · 2024-02-28T14:18:19Z

Codecov Report

Attention: Patch coverage is 0% with 145 lines in your changes are missing coverage. Please review.

Project coverage is 29.88%. Comparing base (a95e9af) to head (c2242b1).
Report is 15 commits behind head on develop.

Additional details and impacted files

@@             Coverage Diff              @@
##           develop    #2531       +/-   ##
============================================
- Coverage    91.20%   29.88%   -61.32%     
============================================
  Files          493      495        +2     
  Lines        45519    45961      +442     
============================================
- Hits         41514    13734    -27780     
- Misses        4005    32227    +28222

Files	Coverage Δ
...f/quantization/algorithms/min_max/torch_backend.py	`0.00% <0.00%> (-97.52%)`	⬇️
nncf/torch/quantization/layers.py	`0.00% <0.00%> (-95.86%)`	⬇️
nncf/torch/sparsity/layers.py	`0.00% <0.00%> (-100.00%)`	⬇️
nncf/torch/layer_utils.py	`0.00% <0.00%> (-96.93%)`	⬇️
nncf/torch/pruning/filter_pruning/layers.py	`0.00% <0.00%> (-100.00%)`	⬇️
nncf/torch/sparsity/rb/layers.py	`0.00% <0.00%> (-97.44%)`	⬇️
...ntization/algorithms/smooth_quant/torch_backend.py	`0.00% <0.00%> (-95.00%)`	⬇️
nncf/torch/graph/transformations/serialization.py	`0.00% <0.00%> (ø)`

... and 323 files with indirect coverage changes

Flag	Coverage Δ
COMMON	`?`
ONNX	`?`
OPENVINO	`?`
TENSORFLOW	`29.88% <0.00%> (-0.16%)`	⬇️
TORCH	`?`

Flags with carried forward coverage won't be shown. Click here to find out more.

Components	Coverage Δ
common	`76.35% <ø> (-17.41%)`	⬇️
torch	`0.01% <0.00%> (-93.61%)`	⬇️
tensorflow	`93.74% <ø> (ø)`
onnx	`0.00% <ø> (-93.07%)`	⬇️
openvino	`0.00% <ø> (-94.19%)`	⬇️
ptq	`15.20% <0.00%> (-74.87%)`	⬇️

nncf/torch/graph/transformations/serialization.py

Preparation for #2531 ### Changes 1) `PTQuantizerInsertionCommand` is removed and replaced with create_quantizer_insertion_command function 2) `SharedFNInsertionCommand` updates with one new attribute: compression_module_type 3) `ExtraOpCallHook` doesn't require context in constructor anymore 4) Multidevice support is moved from `apply_quantizers_insertion_commands_transformation` to `apply_insertion_transformation` ### Reason for changes 1) To make it easier to store and restore commands: less commands - less amount of adapters are needed 2) To make it possible to express `PTQuantizerInsertionCommand` by `SharedFNInsertionCommand` 3) To make it possible to create `ExtraOpCallHook` outside of the `PTModelTransformer` 4) To unify multidevice support for all insertion operations ### Related tickets 2531 ### Tests 1)`test_quantizer_insertion_transformation` is updated 2) - 3) `test_shared_fn_insertion_point` is updated 4) `test_pt_insertion_command` is introduced

API code moved to a separate PR

alexsu52 · 2024-05-02T13:31:14Z

nncf/torch/layer_utils.py

+    """
+
+    @abstractmethod
+    def get_state(self) -> Dict[str, Any]:


What do you think about get_config, from_config to align with high-level API?

nncf/torch/graph/transformations/serialization.py

alexsu52

LGTM

nncf/torch/layer_utils.py

### Changes `_compression_lr_multiplier` attribute introduced in #2531 is removed from the `CompressionParameter` ### Reason for changes `_compression_lr_multiplier` makes the `CompressionParameter` a stateful parameter which for some reason does not work properly in distributed/dataparallel mode ### Tests torch_nightly/213/ - finished successfully

github-actions bot added documentation Improvements or additions to documentation NNCF PT Pull requests that updates NNCF PyTorch NNCF PTQ Pull requests that updates NNCF PTQ labels Feb 28, 2024

openvino-nncf-ci added the API Public API-impacting changes label Feb 28, 2024

daniil-lyakhov force-pushed the dl/ptq/load_state branch from 922170d to 953f395 Compare February 28, 2024 14:18

daniil-lyakhov force-pushed the dl/ptq/load_state branch 2 times, most recently from 86d1356 to e8f7f15 Compare March 15, 2024 13:31

daniil-lyakhov mentioned this pull request Mar 19, 2024

[Torch] Drop PTQuantizerInsertionCommand #2584

Merged

daniil-lyakhov force-pushed the dl/ptq/load_state branch 6 times, most recently from d616ca0 to ec9e451 Compare March 26, 2024 18:35

github-actions bot removed the documentation Improvements or additions to documentation label Mar 26, 2024

daniil-lyakhov force-pushed the dl/ptq/load_state branch 4 times, most recently from 42d0b67 to c95fcf6 Compare March 28, 2024 16:40

daniil-lyakhov changed the title ~~[WIP][PTQ][Torch] Save and load quantization transformation~~ [Torch] Save and load quantization transformations Mar 28, 2024

openvino-nncf-ci removed the API Public API-impacting changes label Mar 28, 2024

daniil-lyakhov marked this pull request as ready for review March 28, 2024 17:20

daniil-lyakhov requested a review from a team as a code owner March 28, 2024 17:20

daniil-lyakhov requested a review from AlexanderDokuchaev March 28, 2024 17:20

daniil-lyakhov changed the title ~~[Torch] Save and load quantization transformations~~ [Torch] Serialize and load NNCF transformations Mar 28, 2024

daniil-lyakhov commented Mar 28, 2024

View reviewed changes

nncf/torch/graph/transformations/serialization.py Outdated Show resolved Hide resolved

nncf/torch/graph/transformations/serialization.py Outdated Show resolved Hide resolved

daniil-lyakhov force-pushed the dl/ptq/load_state branch from 5f834bb to 59ab905 Compare April 2, 2024 09:34

daniil-lyakhov force-pushed the dl/ptq/load_state branch from 59ab905 to 8ec693f Compare April 2, 2024 13:57

daniil-lyakhov mentioned this pull request Apr 11, 2024

[Torch] NNCFNetwork.transformation_layout #2595

Merged

daniil-lyakhov force-pushed the dl/ptq/load_state branch 2 times, most recently from 808d068 to f14aa92 Compare April 25, 2024 11:21

daniil-lyakhov added 2 commits April 25, 2024 13:26

[Torch] Save/load NNCFNetwork state

6a66f01

API code moved to a separate PR

Tests adjusted

9e5197d

daniil-lyakhov force-pushed the dl/ptq/load_state branch 4 times, most recently from 846fe06 to fa077fa Compare April 25, 2024 12:55

Cleanup

fa077fa

daniil-lyakhov requested a review from alexsu52 April 25, 2024 14:04

Specify additional restriction on serializable modules

d0cc6d2

daniil-lyakhov mentioned this pull request Apr 26, 2024

[API][Torch] Save/Load transformations #2659

Merged

alexsu52 reviewed May 2, 2024

View reviewed changes

Comments

bef363e

daniil-lyakhov requested a review from alexsu52 May 3, 2024 08:20

alexsu52 reviewed May 6, 2024

View reviewed changes

nncf/torch/layer_utils.py Outdated Show resolved Hide resolved

daniil-lyakhov requested a review from alexsu52 May 6, 2024 09:07

StatefullTorchModuleInterface -> StatefullModuleInterface

c2242b1

alexsu52 approved these changes May 6, 2024

View reviewed changes

alexsu52 merged commit d507585 into openvinotoolkit:develop May 6, 2024
12 checks passed

daniil-lyakhov mentioned this pull request May 7, 2024

[Torch] Do not change state of the compression parameter #2670

Merged

daniil-lyakhov mentioned this pull request Jun 4, 2024

Release notes v2.11 #2710

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Torch] Serialize and load NNCF transformations #2531

[Torch] Serialize and load NNCF transformations #2531

daniil-lyakhov commented Feb 28, 2024 •

edited

Loading

codecov bot commented Feb 28, 2024 •

edited

Loading

alexsu52 May 2, 2024

daniil-lyakhov May 2, 2024

alexsu52 left a comment

[Torch] Serialize and load NNCF transformations #2531

[Torch] Serialize and load NNCF transformations #2531

Conversation

daniil-lyakhov commented Feb 28, 2024 • edited Loading

Changes

Reason for changes

Related tickets

Tests

codecov bot commented Feb 28, 2024 • edited Loading

Codecov Report

alexsu52 May 2, 2024

Choose a reason for hiding this comment

daniil-lyakhov May 2, 2024

Choose a reason for hiding this comment

alexsu52 left a comment

Choose a reason for hiding this comment

daniil-lyakhov commented Feb 28, 2024 •

edited

Loading

codecov bot commented Feb 28, 2024 •

edited

Loading