Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ONNX]: Add support for data-free Weight Compression Algorithm (#3273) #3346

Open
wants to merge 1 commit into
base: develop
Choose a base branch
from

Conversation

XueSongTap
Copy link

Changes

This PR adds ONNX backend support for the data-free Weight Compression Algorithm.
The implementation:

  • Creates a new WeightCompressionAlgoBackend for ONNX
  • Supports int8 and uint8 weight compression

Reason for changes

This implementation expands NNCF's weight compression support to the ONNX backend,
complementing existing support for OpenVINO, Torch, and TorchFX backends.
This allows users to directly compress ONNX models without conversion to other formats.

Related tickets

Fixes #3273

Tests

@XueSongTap XueSongTap requested a review from a team as a code owner March 16, 2025 08:53
@github-actions github-actions bot added NNCF ONNX Pull requests that updates NNCF ONNX NNCF PTQ Pull requests that updates NNCF PTQ labels Mar 16, 2025
@alexsu52 alexsu52 requested a review from kshpv March 17, 2025 05:28
@alexsu52 alexsu52 self-requested a review March 17, 2025 05:29
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
NNCF ONNX Pull requests that updates NNCF ONNX NNCF PTQ Pull requests that updates NNCF PTQ
Projects
None yet
Development

Successfully merging this pull request may close these issues.

[Good First Issue][NNCF]: Add ONNX support of data-free Weight Compression Algorithm
2 participants