-
Notifications
You must be signed in to change notification settings - Fork 263
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Gptq refactor #1770
Gptq refactor #1770
Conversation
⚡ Required checks status: All passing 🟢Groups summary🟢 Code Scan Tests workflow
These checks are required after the changes to 🟢 Model Tests 3x workflow
These checks are required after the changes to 🟢 Unit Tests 3x-PyTorch workflow
These checks are required after the changes to Thank you for your contribution! 💜
|
One more thing, suggest merging it into master directly. |
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Signed-off-by: xin3he <xin3.he@intel.com>
Type of Change
feature
API changed or not: yes
Description
Reduce additional initialization of quantizer between prepare and convert


Before:
Now:
Reduce additional initialization of quantizer between prepare and convert
Migrate GPTQ to Torch new 3x API
GPTQ
How has this PR been tested?
Pre-ci