Popular repositories Loading
-
models
models PublicForked from onnx/models
A collection of pre-trained, state-of-the-art models in the ONNX format
Jupyter Notebook
-
onnxruntime
onnxruntime PublicForked from microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
C++
-
onnxruntime-inference-examples
onnxruntime-inference-examples PublicForked from microsoft/onnxruntime-inference-examples
Examples for using ONNX Runtime for machine learning inferencing.
Python
-
optimum-intel
optimum-intel PublicForked from huggingface/optimum-intel
🤗 Optimum Intel: Accelerate inference with Intel optimization tools
Python
-
-
auto-round
auto-round PublicForked from intel/auto-round
SOTA Weight-only Quantization Algorithm for LLMs. This is official implementation of "Optimize Weight Rounding via Signed Gradient Descent for the Quantization of LLMs"
Python
87 contributions in the last year
Day of Week | March Mar | April Apr | May May | June Jun | July Jul | August Aug | September Sep | October Oct | November Nov | December Dec | January Jan | February Feb | March Mar | ||||||||||||||||||||||||||||||||||||||||
Sunday Sun | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Monday Mon | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Tuesday Tue | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Wednesday Wed | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Thursday Thu | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Friday Fri | |||||||||||||||||||||||||||||||||||||||||||||||||||||
Saturday Sat |
Contribution activity
March 2025
Created 1 commit in 1 repository
Created 1 repository
-
mengniwang95/vllm-fork
Python
•
Built by
This contribution was made on Mar 24
Created a pull request in intel/neural-compressor that received 1 comment
Opened 2 other pull requests in 2 repositories
yiliu30/vllm-fork
1
merged
-
enable layer-by-layer
This contribution was made on Mar 3
intel/neural-compressor
1
merged
-
Enable layer-by-layer convert for vllm Deepseek model
This contribution was made on Mar 3
Reviewed 1 pull request in 1 repository
intel/neural-compressor
1 pull request
-
Add transformers to align onnxruntime-extensions=1.14.0
This contribution was made on Mar 20