Enable optimized compression of models with fp8 weights #20

nikita-savelyevv · 2025-03-17T17:30:44Z

Changes

Reason for changes

Related tickets

Tests

nikita-savelyevv · 2025-03-17T17:32:19Z

nncf/openvino/optimized_functions/models.py

@@ -381,6 +387,7 @@ def _build_compress_model(
    # Build OV model
    weight = opset.parameter(weight_shape, name="weight", dtype=DTYPE_MAP_OV[weight_dtype])
    ov_parameters = [weight]
+    weight = convert_op(weight, ov.Type.f32)


Casting to FP32 in the beginning because FP8 data type does not support abs operation.

nikita-savelyevv · 2025-03-17T17:32:48Z

@alexsu52 please take a look

alexsu52 · 2025-03-18T07:50:27Z

@alexsu52 please take a look

Thanks for the PR. As far as I understand how to compress of fp8 data types is open question. Do you have any tickets where user scenario are described.

nikita-savelyevv · 2025-03-18T09:00:41Z

Thanks for the PR. As far as I understand how to compress of fp8 data types is open question. Do you have any tickets where user scenario are described.

Unfortunately not. Closing then.

Enable optimized compression of models with fp8 weights

db0a5d6

github-actions bot added the NNCF OpenVINO label Mar 17, 2025

nikita-savelyevv commented Mar 17, 2025

View reviewed changes

nikita-savelyevv mentioned this pull request Mar 17, 2025

FP8 types support in NNCF graph building openvinotoolkit/nncf#3344

Merged

nikita-savelyevv closed this Mar 18, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable optimized compression of models with fp8 weights #20

Enable optimized compression of models with fp8 weights #20

nikita-savelyevv commented Mar 17, 2025

nikita-savelyevv Mar 17, 2025

nikita-savelyevv commented Mar 17, 2025

alexsu52 commented Mar 18, 2025

nikita-savelyevv commented Mar 18, 2025

Enable optimized compression of models with fp8 weights #20

Enable optimized compression of models with fp8 weights #20

Conversation

nikita-savelyevv commented Mar 17, 2025

Changes

Reason for changes

Related tickets

Tests

nikita-savelyevv Mar 17, 2025

Choose a reason for hiding this comment

nikita-savelyevv commented Mar 17, 2025

alexsu52 commented Mar 18, 2025

nikita-savelyevv commented Mar 18, 2025