Skip to content

Commit 75bf8a3

Browse files
Update ReleaseNotes.md
1 parent 972acc9 commit 75bf8a3

File tree

1 file changed

+2
-1
lines changed

1 file changed

+2
-1
lines changed

ReleaseNotes.md

+2-1
Original file line numberDiff line numberDiff line change
@@ -10,14 +10,15 @@ Post-training Quantization:
1010
- ...
1111
- Features:
1212
- (TorchFX, Experimental) Preview support for the new `quantize_pt2e` API has been introduced, enabling quantization of `torch.fx.GraphModule` models with the `OpenVINOQuantizer` and the `X86InductorQuantizer` quantizers. `quantize_pt2e` API utilizes `MinMax` algorithm statistic collectors, as well as `SmoothQuant`, `BiasCorrection` and `FastBiasCorrection` Post-Training Quantization algorithms.
13+
- (TensorFlow) The `nncf.quantize()` method is now the recommended way for the quantization initialization for Quantization-Aware Training. Please refer to an [example](examples/quantization_aware_training/tensorflow/mobilenet_v2) for more details about how to use new approach.
1314
- Fixes:
1415
- ...
1516
- Improvements:
1617
- Significantly faster data-free weight compression for OpenVINO models: INT4 compression is now up to 10x faster, while INT8 compression is up to 3x faster. The larger the model the higher the time reduction.
1718
- AWQ weight compression is now up to 2x faster, improving overall runtime efficiency.
1819
- Peak memory usage during INT4 data-free weight compression in the OpenVINO backend is reduced up to 50% for certain models.
1920
- Deprecations/Removals:
20-
- ...
21+
- (TensorFlow) The `nncf.tensorflow.create_compressed_model()` method is now marked as deprecated. Please use the `nncf.quantize()` method for the quantization initialization.
2122
- Tutorials:
2223
- [Post-Training Optimization of GLM-Edge-V Model](https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/glm-edge-v/glm-edge-v.ipynb)
2324
- [Post-Training Optimization of OmniGen Model](https://github.com/openvinotoolkit/openvino_notebooks/blob/latest/notebooks/omnigen/omnigen.ipynb)

0 commit comments

Comments
 (0)