[OV] Add --all-layers argument to CLI #713

nikita-savelyevv · 2024-05-16T09:14:05Z

What does this PR do?

Add --all-layers quantization argument to openvino export CLI interface.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

tests/openvino/test_exporters_cli.py

HuggingFaceDocBuilderDev · 2024-05-16T09:21:26Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

helena-intel · 2024-05-16T10:48:11Z

optimum/commands/export/openvino.py

+        action="store_true",
+        default=None,
+        help=(
+            "Whether embeddings and last MatMul layers should be compressed to a primary precision (usually, INT4)."


Is there a non-INT4 usecase? I see below that if it is provided, all_layers is set to None if is_int8 else self.args.all_layers so it seems like it's ignored for INT8? If it is only for INT4 it would be good to clarify that in the help message. And possibly make it mutually exclusive with INT8.

This is for INT4 only.

Thanks for your comment! By default we compress those layers to INT8, even if number of bits is set to 4. This flag allows to compress those layer to INT4 as well. I've updated the description, hopefully it is more clear now (since currently the only supported primary precision is INT4, I've rephrased it a bit).

echarlaix

LGTM, thanks @nikita-savelyevv

Add --all-layers argument to CLI

2b424b0

nikita-savelyevv commented May 16, 2024

View reviewed changes

tests/openvino/test_exporters_cli.py Show resolved Hide resolved

helena-intel reviewed May 16, 2024

View reviewed changes

Update description

6bb1330

nikita-savelyevv requested review from AlexKoff88 and helena-intel May 17, 2024 13:12

AlexKoff88 approved these changes May 17, 2024

View reviewed changes

echarlaix approved these changes May 17, 2024

View reviewed changes

echarlaix merged commit bc5051f into huggingface:main May 17, 2024
11 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[OV] Add --all-layers argument to CLI #713

[OV] Add --all-layers argument to CLI #713

nikita-savelyevv commented May 16, 2024

HuggingFaceDocBuilderDev commented May 16, 2024

helena-intel May 16, 2024

AlexKoff88 May 16, 2024

nikita-savelyevv May 16, 2024 •

edited

Loading

echarlaix left a comment

[OV] Add --all-layers argument to CLI #713

[OV] Add --all-layers argument to CLI #713

Conversation

nikita-savelyevv commented May 16, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented May 16, 2024

helena-intel May 16, 2024

Choose a reason for hiding this comment

AlexKoff88 May 16, 2024

Choose a reason for hiding this comment

nikita-savelyevv May 16, 2024 • edited Loading

Choose a reason for hiding this comment

echarlaix left a comment

Choose a reason for hiding this comment

nikita-savelyevv May 16, 2024 •

edited

Loading