Aanuf/data free awq #3315

andreyanufr · 2025-02-26T19:44:30Z

Changes

Data free AWQ: smooth down_proj input channels and merge extra scale to up_proj output channels.

Reason for changes

ljaljushkin · 2025-03-14T11:35:26Z

nncf/quantization/algorithms/weight_compression/awq.py

+            w_scale = fns.unsqueeze(scale, 1 - wp.reduction_axes[0])
+            a_scale = fns.unsqueeze(1.0 / scale, wp.reduction_axes[0])
+
+            scaled_weight = weight * w_scale


Please rebase from develop, I've made some casting for float16 models

ljaljushkin · 2025-03-14T11:37:07Z

nncf/quantization/algorithms/weight_compression/awq.py

+        model_transformer = ModelTransformerFactory.create(model, inplace=True)
+
+        is_data_free = True #statistics is None
+        description = "Applying data-free AWQ" if is_data_free else "Applying AWQ"


Suggested change

description = "Applying data-free AWQ" if is_data_free else "Applying AWQ"

description = "Applying data-free AWQ" if is_data_free else "Applying data-aware AWQ"

maybe more clear

ljaljushkin · 2025-03-14T11:40:41Z

nncf/quantization/algorithms/weight_compression/algorithm.py

@@ -506,7 +506,7 @@ def apply(
        nodes_to_compress = self.get_nodes_to_compress(graph)

        statistics = None
-        if self._data_aware_mixed_precision or self._data_aware_compression:
+        if (self._data_aware_mixed_precision or self._data_aware_compression) and dataset:


I'd redefine

self._data_aware_compression = (self._awq and dataset) or (...)

alexsu52 and others added 16 commits September 2, 2024 13:22

Support scale estimation inside GPTQ

488cacc

fix for INT4_ASYM

ee64877

Merge remote-tracking branch 'upstream/develop' into develop

f22e411

Merge remote-tracking branch 'upstream/develop' into develop

51b4d7b

Merge remote-tracking branch 'upstream/develop' into develop

f66cd1e

Merge remote-tracking branch 'upstream/develop' into develop

7ce5a53

Merge remote-tracking branch 'upstream/develop' into develop

f74d156

Merge remote-tracking branch 'upstream/develop' into develop

5288c79

Merge remote-tracking branch 'upstream/develop' into develop

1becf15

Merge remote-tracking branch 'upstream/develop' into develop

047d7d9

Merge remote-tracking branch 'upstream/develop' into develop

c0c7e57

Merge remote-tracking branch 'upstream/develop' into develop

b74dea1

Merge remote-tracking branch 'upstream/develop' into develop

26a9a77

Merge remote-tracking branch 'upstream/develop' into develop

25fcc2c

Data-free AWQ prototype.

f6f4693

Data free AWQ.

19a64ac

github-actions bot added the NNCF PTQ Pull requests that updates NNCF PTQ label Feb 26, 2025

Fixed style.

bf215d5

github-actions bot added the NNCF OpenVINO Pull requests that updates NNCF OpenVINO label Feb 27, 2025

andreyanufr added 2 commits February 27, 2025 10:18

Fixed shape of data item int test.

566ebe7

Fixed test case for E2M1.

70e47c8

github-actions bot added the API Public API-impacting changes label Mar 4, 2025

Enable awq by default.

6b3310b

andreyanufr requested a review from ljaljushkin March 13, 2025 08:16

ljaljushkin reviewed Mar 14, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Aanuf/data free awq #3315

Aanuf/data free awq #3315

andreyanufr commented Feb 26, 2025

ljaljushkin Mar 14, 2025

ljaljushkin Mar 14, 2025

ljaljushkin Mar 14, 2025

	description = "Applying data-free AWQ" if is_data_free else "Applying AWQ"
	description = "Applying data-free AWQ" if is_data_free else "Applying data-aware AWQ"

Aanuf/data free awq #3315

Are you sure you want to change the base?

Aanuf/data free awq #3315

Conversation

andreyanufr commented Feb 26, 2025

Changes

Reason for changes

ljaljushkin Mar 14, 2025

Choose a reason for hiding this comment

ljaljushkin Mar 14, 2025

Choose a reason for hiding this comment

ljaljushkin Mar 14, 2025

Choose a reason for hiding this comment