Transition to a newer NNCF API for PyTorch model quantization #630

nikita-savelyevv · 2024-03-22T16:07:43Z

Post-Training Quantization for PyTorch backend with nncf.create_compressed_model() API is obsolete and should be replaced with nncf.quantize() call which is used for OV backend.

What does this PR do?

Replace nncf.create_compressed_model() call with nncf.quantize() call for quantization of PyTorch models.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

…ackend

HuggingFaceDocBuilderDev · 2024-03-22T16:13:04Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

nikita-savelyevv · 2024-03-25T08:12:55Z

@AlexKoff88 @alexsu52 could you please review this PR?

AlexKoff88 · 2024-03-25T10:12:41Z

@echarlaix, as we agreed, this is the first PR in the series of changes to deprecate legacy NNCF functions and move all the actual optimization functionality under OVQuantizer API. Please take a look.

echarlaix

Looks good thanks @nikita-savelyevv

optimum/intel/openvino/quantization.py

echarlaix · 2024-03-25T17:52:34Z

optimum/intel/openvino/quantization.py

@@ -360,7 +358,7 @@ def _quantize_torchmodel(
            logger.info(
                "No configuration describing the quantization process was provided, a default OVConfig will be generated."
            )
-            ov_config = OVConfig(compression=DEFAULT_QUANTIZATION_CONFIG)
+            ov_config = OVConfig()


not used for quantization (but now only for the save_onnx_model parameter) so not sure we need to create an instance when not provided, we can only give save_onnx_model a default value instead. Also no need to save the configuration after the quantization + export steps for the same reason : would remove saving the config here

optimum-intel/optimum/intel/openvino/quantization.py

Line 444 in c2d267a

ov_config.save_pretrained(save_directory)

Made the suggested changes, also removed ov_config from list of arguments altogether. In the future will bring it back to pass quantization parameters through it.

optimum/intel/openvino/quantization.py

Replace create_compressed_model call with quantize call for PyTorch b…

25bbc2b

…ackend

nikita-savelyevv changed the title ~~[NNCF] Replace create_compressed_model call with quantize call for PyTorch backend~~ Transition to a newer NNCF API for PyTorch model quantization Mar 22, 2024

nikita-savelyevv marked this pull request as ready for review March 25, 2024 08:12

AlexKoff88 requested a review from echarlaix March 25, 2024 10:10

AlexKoff88 approved these changes Mar 25, 2024

View reviewed changes

echarlaix approved these changes Mar 25, 2024

View reviewed changes

nikita-savelyevv added 2 commits March 25, 2024 20:13

Remove ov_config from torch quantization arguments

5544e61

Tweak test

ae04ab6

echarlaix approved these changes Mar 26, 2024

View reviewed changes

echarlaix merged commit a3bf172 into huggingface:main Mar 26, 2024
9 of 10 checks passed

nikita-savelyevv mentioned this pull request Apr 3, 2024

Introduce OVQuantizationConfig for nncf.quantize() parameters #638

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Transition to a newer NNCF API for PyTorch model quantization #630

Transition to a newer NNCF API for PyTorch model quantization #630

nikita-savelyevv commented Mar 22, 2024

HuggingFaceDocBuilderDev commented Mar 22, 2024

nikita-savelyevv commented Mar 25, 2024 •

edited

Loading

AlexKoff88 commented Mar 25, 2024

echarlaix left a comment

echarlaix Mar 25, 2024

nikita-savelyevv Mar 25, 2024

Transition to a newer NNCF API for PyTorch model quantization #630

Transition to a newer NNCF API for PyTorch model quantization #630

Conversation

nikita-savelyevv commented Mar 22, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented Mar 22, 2024

nikita-savelyevv commented Mar 25, 2024 • edited Loading

AlexKoff88 commented Mar 25, 2024

echarlaix left a comment

Choose a reason for hiding this comment

echarlaix Mar 25, 2024

Choose a reason for hiding this comment

nikita-savelyevv Mar 25, 2024

Choose a reason for hiding this comment

nikita-savelyevv commented Mar 25, 2024 •

edited

Loading