dump model before compress cli #723

eaidova · 2024-05-22T10:29:09Z

What does this PR do?

Fixes # (issue)

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you make sure to update the documentation with your changes?
Did you write any new necessary tests?

HuggingFaceDocBuilderDev · 2024-05-22T10:34:11Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

AlexKoff88 · 2024-05-22T11:14:43Z

optimum/exporters/openvino/convert.py

@@ -412,11 +414,24 @@ def ts_patched_forward(*args, **kwargs):

        if stateful:
            patch_stateful(model.config, ov_model)
+        if ov_config.quantization_config:


there is another PR where we probably solve this problem: https://github.com/huggingface/optimum-intel/pull/721/files

@nikita-savelyevv PR, as I understand it works only for quantization with dataset for specific cases, when you need to infer model. The goal of my changes remove pytorch model before any weights compression process started for free memory (IR before saving on disk shares weights with pytorch model and additionally may requires own memory on top of that. When we use API for exporting model, weights compression happens after conversion step finished and we already removed pyotrch model from RAM, but when optimum-cli used conversion and compression combined in one step and pytorch model is still alive at compression step)

I open it for experiment only, both changes can be useful, I think we can combine them

Yes, seems like the changes are independent.

@eaidova Just out of curiosity, do you have numbers for how much memory can be saved with this approach?

thanks, @eaidova. @nikita-savelyevv, can you please adopt changes from this PR?

@AlexKoff88 I don't think this applies to my case because I don't have access to the PyTorch or OpenVINO model objects after the main_export call. So I can't delete them.

Or do you mean to copy changes from this PR to my PR? If so, in my opinion these should be added separately.

sorry, I didn't notice that this is for torch models only. Then, it makes sense to keep these changes separately.

dump model before compress cli

309d160

AlexKoff88 reviewed May 22, 2024

View reviewed changes

eaidova closed this Jun 19, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

dump model before compress cli #723

dump model before compress cli #723

eaidova commented May 22, 2024

HuggingFaceDocBuilderDev commented May 22, 2024

AlexKoff88 May 22, 2024

eaidova May 22, 2024

nikita-savelyevv May 22, 2024

AlexKoff88 May 22, 2024

nikita-savelyevv May 22, 2024

AlexKoff88 May 22, 2024

dump model before compress cli #723

dump model before compress cli #723

Conversation

eaidova commented May 22, 2024

What does this PR do?

Before submitting

HuggingFaceDocBuilderDev commented May 22, 2024

AlexKoff88 May 22, 2024

Choose a reason for hiding this comment

eaidova May 22, 2024

Choose a reason for hiding this comment

nikita-savelyevv May 22, 2024

Choose a reason for hiding this comment

AlexKoff88 May 22, 2024

Choose a reason for hiding this comment

nikita-savelyevv May 22, 2024

Choose a reason for hiding this comment

AlexKoff88 May 22, 2024

Choose a reason for hiding this comment