-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Support AWQ models #1049
Merged
Merged
Support AWQ models #1049
Changes from all commits
Commits
Show all changes
34 commits
Select commit
Hold shift + click to select a range
64f64b0
Support AWQ models
mvafin 86d9328
Add tests
mvafin decbcc2
Add dependencies
mvafin 9fb1da4
Fix tests
mvafin 04d0cf9
enable awq export only if ov support it
eaidova b51cdee
Merge pull request #1 from eaidova/ea/awq_fix
eaidova df97004
fix style (#2)
eaidova cf2fc8b
disable awq and gptq install for old torch (#3)
eaidova ae8c7db
Merge branch 'main' into mvafin/support_awq
eaidova f0f7a72
separate common quant models patching and gptq (#4)
eaidova ab6ac99
disable windows install (#5)
eaidova ff66f43
skip logits check for quantized models (#6)
eaidova 3b73f17
Merge branch 'main' into mvafin/support_awq
eaidova e8be988
fix test after rebase
eaidova 5d8bcb7
fix testing condition for 2024.6 and unpatch in case if failed
eaidova cf3aad4
Fix qwen2-vl tests (#1084)
nikita-savelyevv 106a5b7
Skip private mdoel loading test for external contributors (#1082)
echarlaix cda4908
Fix reshaping unet if timestep is 0d tensor (#1083)
eaidova 8ef3997
Disable kv cache compression for fp vlm (#1080)
eaidova 8e5573f
Support AWQ models
mvafin 0d7f4bf
Add tests
mvafin ae544af
Add dependencies
mvafin 013081c
Fix tests
mvafin b7cd49d
enable awq export only if ov support it
eaidova da3bd88
fix style (#2)
eaidova 0a0c7aa
disable awq and gptq install for old torch (#3)
eaidova 55dad0c
separate common quant models patching and gptq (#4)
eaidova c05aaf0
disable windows install (#5)
eaidova 40cd57f
skip logits check for quantized models (#6)
eaidova 9ddc5a8
fix test after rebase
eaidova a241a7d
fix testing condition for 2024.6 and unpatch in case if failed
eaidova b0e4860
add necessary packages in test_openvino_full
eaidova 630d36a
Merge branch 'mvafin/support_awq' of https://github.com/mvafin/optimu…
eaidova 7607f45
fix code style after rebase (#7)
eaidova File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
these are not valid extra urls for auto-gptq and awq
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
this is needed for preventing reinstalling torch with cuda during installing third-party, packages themselves should be installed from regular source, torch-dependent libs (the difference from --index-url and --extra-index-url that first redefine source index completely, the second one parameter used for usage index URL as additional source if library present in that source) will be tried to install from torch cpu url