Migrate RTN, HQQ and AWQ to Torch new 3.x API #1765

yuwenzho · 2024-04-29T06:04:39Z

Type of Change

feature
API changed or not: yes

Description

Migrate RTN, HQQ and AWQ to Torch new 3x API

RTN

from neural_compressor.torch.quantization import get_default_rtn_config, prepare, convert
quant_config = get_default_rtn_config()
model= prepare(model, quant_config)
q_model = convert(model)

HQQ

from neural_compressor.torch.quantization import get_default_hqq_config, prepare, convert
quant_config = get_default_hqq_config()
model= prepare(model, quant_config)
q_model = convert(model)

AWQ

from neural_compressor.torch.quantization import get_default_awq_config, prepare, convert
quant_config = get_default_awq_config()
# prepare
model= prepare(model, quant_config, example_inputs=example_inputs)
# calibrate
calib_func(model)
# convert
q_model = convert(model)

How has this PR been tested?

CI

Dependency Change?

no

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

github-actions · 2024-04-29T06:05:00Z

⚡ Required checks status: All passing 🟢

Groups summary

🟢 Code Scan Tests workflow

Check ID	Status
Code-Scan	success	✅
Code-Scan (Bandit Code Scan Bandit)	success	✅
Code-Scan (DocStyle Code Scan DocStyle)	success	✅
Code-Scan (Pylint Code Scan Pylint)	success	✅

These checks are required after the changes to neural_compressor/torch/algorithms/base_algorithm.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/weight_only/awq.py, neural_compressor/torch/algorithms/weight_only/hqq/__init__.py, neural_compressor/torch/algorithms/weight_only/hqq/quant_api.py, neural_compressor/torch/algorithms/weight_only/hqq/quantizer.py, neural_compressor/torch/algorithms/weight_only/rtn.py, neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py.

🟢 Model Tests 3x workflow

Check ID	Status
Model-Test-3x	success	✅
Model-Test-3x (Generate Report GenerateReport)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_bnb)	success	✅
Model-Test-3x (Run PyTorch Model opt_125m_woq_gptq_int4_dq_ggml)	success	✅

These checks are required after the changes to neural_compressor/torch/algorithms/base_algorithm.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/weight_only/awq.py, neural_compressor/torch/algorithms/weight_only/hqq/__init__.py, neural_compressor/torch/algorithms/weight_only/hqq/quant_api.py, neural_compressor/torch/algorithms/weight_only/hqq/quantizer.py, neural_compressor/torch/algorithms/weight_only/rtn.py, neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py.

🟢 Unit Tests 3x-PyTorch workflow

Check ID	Status
UT-3x-Torch	success	✅
UT-3x-Torch (Coverage Compare CollectDatafiles)	success	✅
UT-3x-Torch (Unit Test 3x Torch Unit Test 3x Torch)	success	✅
UT-3x-Torch (Unit Test 3x Torch baseline Unit Test 3x Torch baseline)	success	✅

These checks are required after the changes to neural_compressor/torch/algorithms/base_algorithm.py, neural_compressor/torch/algorithms/static_quant/static_quant.py, neural_compressor/torch/algorithms/weight_only/awq.py, neural_compressor/torch/algorithms/weight_only/hqq/__init__.py, neural_compressor/torch/algorithms/weight_only/hqq/quant_api.py, neural_compressor/torch/algorithms/weight_only/hqq/quantizer.py, neural_compressor/torch/algorithms/weight_only/rtn.py, neural_compressor/torch/algorithms/weight_only/utility.py, neural_compressor/torch/quantization/algorithm_entry.py, test/3x/torch/quantization/weight_only/hqq/test_hqq_cpu.py, test/3x/torch/quantization/weight_only/hqq/test_hqq_cuda.py, test/3x/torch/quantization/weight_only/test_awq.py, test/3x/torch/quantization/weight_only/test_rtn.py.

Thank you for your contribution! 💜

Note
This comment is automatically generated and will be updates every 180 seconds within the next 6 hours. If you have any other questions, contact chensuyue or XuehaoSun for help.

for more information, see https://pre-commit.ci

yiliu30

Others are LTGM.

test/3x/torch/quantization/weight_only/hqq/test_hqq_cuda.py

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

for more information, see https://pre-commit.ci

yuwenzho added 4 commits April 28, 2024 02:39

migrate AWQ to torch new 3.x API

50c30b2

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

Merge branch 'master' into yuwenzho/refactor_rtn_hqq_awq

9e5306d

migrate RTN & HQQ to torch new 3.x API

017b1d0

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

enhance docstring

bb7ea33

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

yuwenzho added INC3.X PyTorch Related to PyTorch F/W labels Apr 29, 2024

yuwenzho requested review from Kaihui-intel, yiliu30 and xin3he and removed request for Kaihui-intel and yiliu30 April 29, 2024 06:04

yuwenzho requested a review from yiliu30 April 29, 2024 06:05

yuwenzho and others added 2 commits April 28, 2024 23:05

Merge branch 'master' into yuwenzho/refactor_rtn_hqq_awq

04502ea

[pre-commit.ci] auto fixes from pre-commit.com hooks

76fda52

for more information, see https://pre-commit.ci

yiliu30 approved these changes Apr 29, 2024

View reviewed changes

test/3x/torch/quantization/weight_only/hqq/test_hqq_cuda.py Outdated Show resolved Hide resolved

Kaihui-intel approved these changes Apr 29, 2024

View reviewed changes

enhance base Quantizer

d209c9a

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

xin3he approved these changes Apr 30, 2024

View reviewed changes

yuwenzho and others added 5 commits April 30, 2024 01:09

Merge branch 'master' into yuwenzho/refactor_rtn_hqq_awq

1aacaa3

Merge branch 'master' into yuwenzho/refactor_rtn_hqq_awq

eebb5a6

Merge branch 'master' into yuwenzho/refactor_rtn_hqq_awq

9e48ac6

enhance code

c8f42e2

Signed-off-by: yuwenzho <yuwen.zhou@intel.com>

[pre-commit.ci] auto fixes from pre-commit.com hooks

2bb1fc6

for more information, see https://pre-commit.ci

yuwenzho merged commit 1a45090 into master May 7, 2024
30 checks passed

yuwenzho deleted the yuwenzho/refactor_rtn_hqq_awq branch May 7, 2024 08:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Migrate RTN, HQQ and AWQ to Torch new 3.x API #1765

Migrate RTN, HQQ and AWQ to Torch new 3.x API #1765

yuwenzho commented Apr 29, 2024 •

edited

Loading

github-actions bot commented Apr 29, 2024 •

edited

Loading

yiliu30 left a comment

Migrate RTN, HQQ and AWQ to Torch new 3.x API #1765

Migrate RTN, HQQ and AWQ to Torch new 3.x API #1765

Conversation

yuwenzho commented Apr 29, 2024 • edited Loading

Type of Change

Description

How has this PR been tested?

Dependency Change?

github-actions bot commented Apr 29, 2024 • edited Loading

⚡ Required checks status: All passing 🟢

Groups summary

yiliu30 left a comment

Choose a reason for hiding this comment

yuwenzho commented Apr 29, 2024 •

edited

Loading

github-actions bot commented Apr 29, 2024 •

edited

Loading