[Good First Issue][NNCF]: Dump actual_subset_size to ov.Model #2562

l-bat · 2024-03-08T11:55:14Z

Context

After applying quantization to the ov.Model in Neural Network Compression Framework (NNCF), the quantization parameters, including subset_size, are dumped to the meta section of the OpenVINO IR. subset_size represents the size of the dataset used for calibration.

nncf/nncf/openvino/quantization/quantize_model.py

Line 102 in 09960b9

"subset_size": subset_size,

However, inconsistencies arise when the dataset size is less than the provided or default 'subset_size'. To address this confusion, it is proposed to also dump the actual_subset_size, which denotes the number of data samples used to calculate activation statistics. This addition will improve clarity and accuracy in managing quantization parameters and assist in reproducing quantization results.

What needs to be done?

Dump actual_subset_size parameter to ov.Model meta section.
Add tests

Example Pull Requests

No response

Resources

Contribution guide - start here!

Contact points

@l-bat

Ticket

No response

The text was updated successfully, but these errors were encountered:

AiGaf1 · 2024-03-09T14:48:00Z

.take

github-actions · 2024-03-09T14:48:10Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

andrey-churkin · 2024-03-21T17:08:40Z

@AiGaf1 Hi, are you still working on this task? Do you need any help? Please inform us if you do not plan to continue working on this task. Thanks!

Dump actual_subset_size to ov.Model (Issue openvinotoolkit#2562)

RitikaxShakya · 2024-03-29T11:18:43Z

Hello! is there any update on this issue? If not i wish to work on this issue.

p-wysocki · 2024-04-03T18:29:01Z

@l-bat could you please reassign the issue to @RitikaxShakya? I lack the permissions for NNCF repository.

RitikaxShakya · 2024-04-04T06:45:48Z

.take

github-actions · 2024-04-04T06:46:00Z

Thanks for being interested in this issue. It looks like this ticket is already assigned to a contributor. Please communicate with the assigned contributor to confirm the status of the issue.

p-wysocki · 2024-05-06T09:43:39Z

Hello @RitikaxShakya, are you still working on that issue? Do you need any help?

awayzjj · 2024-06-27T23:39:00Z

.take

github-actions · 2024-06-27T23:39:12Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

awayzjj · 2024-06-29T04:01:24Z

Hi @l-bat I created a PR after testing locally, and the XML output is as expected:

I ran the pytest in tests/openvino, and the original tests did not break. But I have 2 questions

which file I should edit to add my unit test.
how to implement the unit test, should I check the output XML to verify whether the actual_subset_size property exists?

Thank you very much!

l-bat · 2024-07-01T12:04:14Z

Hi @awayzjj!
Thanks for your contribution!

calibration_dataset.get_length() returns the size of the dataset that was provided to the nncf.quantize method, however actual_subset_size should show the number of data samples that were used to calculate the activation statistics.
In the case of calibration_dataset.get_length() >= subset_size, actual_subset_size is equal to subset_size. Otherwise, actual_subset_size must be equal to calibration_dataset.get_length(). But it is not possible to use the get_length() method if __len__() is not implemented. Please take a look at

nncf/nncf/common/tensor_statistics/aggregator.py

Lines 50 to 53 in e8ea252

    
           dataset_length = self.dataset.get_length() 
        
           if dataset_length and self.stat_subset_size: 
        
               return min(dataset_length, self.stat_subset_size) 
        
           return dataset_length or self.stat_subset_size

. You can implement the get_actual_subset_size() function.

l-bat · 2024-07-01T12:08:49Z

I ran the pytest in tests/openvino, and the original tests did not break. But I have 2 questions

which file I should edit to add my unit test.

how to implement the unit test, should I check the output XML to verify whether the actual_subset_size property exists?

You can add test to https://github.com/openvinotoolkit/nncf/blob/e8ea2521663de807d654ae4f375d20c904755061/tests/openvino/native/quantization/test_quantization_pipeline.py

You can use the test as an example

nncf/tests/openvino/native/quantization/test_quantization_pipeline.py

Lines 178 to 199 in e8ea252

    
           def test_ignored_scope_dump(ignored_options, expected_dump, tmp_path): 
        
               ignored_scope_path = ["nncf", "quantization", "ignored_scope"] 
        
               model = WeightsModel().ov_model 
        
               dataset = get_dataset_for_test(model) 
        
               quantize_parameters = { 
        
                   "preset": QuantizationPreset.PERFORMANCE, 
        
                   "target_device": TargetDevice.CPU, 
        
                   "subset_size": 1, 
        
                   "fast_bias_correction": True, 
        
                   "ignored_scope": ignored_options, 
        
               } 
        
               quantized_model = quantize_impl(model, dataset, **quantize_parameters) 
        
               ov.save_model(quantized_model, tmp_path / "ov_model.xml") 
        
               core = ov.Core() 
        
               dumped_model = core.read_model(tmp_path / "ov_model.xml") 
        
               for key, value in expected_dump.items(): 
        
                   rt_path = ignored_scope_path + [key] if key else ignored_scope_path 
        
                   if value: 
        
                       assert dumped_model.get_rt_info(rt_path) == value 
        
                   else: 
        
                       assert dumped_model.has_rt_info(rt_path) is False

awayzjj · 2024-08-26T14:18:24Z

@l-bat Hi, I've been really busy lately, so I've decided to unassign myself for now. I apologize for any inconvenience this may cause.

zina-cs · 2024-09-26T14:28:34Z

.take

github-actions · 2024-09-26T14:28:49Z

Thank you for looking into this issue! Please let us know if you have any questions or require any help.

l-bat added the good first issue Good for newcomers label Mar 8, 2024

alexsu52 added this to Good first issues Mar 8, 2024

github-project-automation bot moved this to Contributors Needed in Good first issues Mar 8, 2024

github-actions bot assigned AiGaf1 Mar 9, 2024

alexsu52 moved this from Contributors Needed to Assigned in Good first issues Mar 11, 2024

AiGaf1 added a commit to AiGaf1/nncf that referenced this issue Mar 22, 2024

Update quantize_model.py

989fd95

Dump actual_subset_size to ov.Model (Issue openvinotoolkit#2562)

alexsu52 assigned RitikaxShakya and unassigned AiGaf1 Apr 18, 2024

RitikaxShakya removed their assignment Jun 20, 2024

alexsu52 moved this from Assigned to Contributors Needed in Good first issues Jun 27, 2024

github-actions bot assigned awayzjj Jun 27, 2024

alexsu52 moved this from Contributors Needed to Assigned in Good first issues Jun 28, 2024

awayzjj mentioned this issue Jun 29, 2024

dump actual_subset_size #2769

Closed

alexsu52 moved this from Assigned to In Review in Good first issues Jul 1, 2024

awayzjj removed their assignment Aug 26, 2024

alexsu52 moved this from In Review to Contributors Needed in Good first issues Aug 29, 2024

github-actions bot assigned zina-cs Sep 26, 2024

zina-cs mentioned this issue Sep 26, 2024

Adding Len checks for subset size test #2993

Closed

alexsu52 moved this from Contributors Needed to Assigned in Good first issues Sep 26, 2024

zina-cs mentioned this issue Sep 27, 2024

Update aggregator.py #2995

Merged

alexsu52 closed this as completed in #2995 Oct 17, 2024

alexsu52 closed this as completed in e6a4752 Oct 17, 2024

github-project-automation bot moved this from Assigned to Closed in Good first issues Oct 17, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Good First Issue][NNCF]: Dump actual_subset_size to ov.Model #2562

[Good First Issue][NNCF]: Dump actual_subset_size to ov.Model #2562

l-bat commented Mar 8, 2024 •

edited

Loading

AiGaf1 commented Mar 9, 2024

github-actions bot commented Mar 9, 2024

andrey-churkin commented Mar 21, 2024

RitikaxShakya commented Mar 29, 2024 •

edited

Loading

p-wysocki commented Apr 3, 2024 •

edited

Loading

RitikaxShakya commented Apr 4, 2024

github-actions bot commented Apr 4, 2024

p-wysocki commented May 6, 2024

awayzjj commented Jun 27, 2024

github-actions bot commented Jun 27, 2024

awayzjj commented Jun 29, 2024

l-bat commented Jul 1, 2024

l-bat commented Jul 1, 2024

awayzjj commented Aug 26, 2024

zina-cs commented Sep 26, 2024

github-actions bot commented Sep 26, 2024

[Good First Issue][NNCF]: Dump actual_subset_size to ov.Model #2562

[Good First Issue][NNCF]: Dump actual_subset_size to ov.Model #2562

Comments

l-bat commented Mar 8, 2024 • edited Loading

Context

What needs to be done?

Example Pull Requests

Resources

Contact points

Ticket

AiGaf1 commented Mar 9, 2024

github-actions bot commented Mar 9, 2024

andrey-churkin commented Mar 21, 2024

RitikaxShakya commented Mar 29, 2024 • edited Loading

p-wysocki commented Apr 3, 2024 • edited Loading

RitikaxShakya commented Apr 4, 2024

github-actions bot commented Apr 4, 2024

p-wysocki commented May 6, 2024

awayzjj commented Jun 27, 2024

github-actions bot commented Jun 27, 2024

awayzjj commented Jun 29, 2024

l-bat commented Jul 1, 2024

l-bat commented Jul 1, 2024

awayzjj commented Aug 26, 2024

zina-cs commented Sep 26, 2024

github-actions bot commented Sep 26, 2024

l-bat commented Mar 8, 2024 •

edited

Loading

RitikaxShakya commented Mar 29, 2024 •

edited

Loading

p-wysocki commented Apr 3, 2024 •

edited

Loading