Skip to content

SOTA low-bit LLM quantization (INT8/FP8/INT4/FP4/NF4) & sparsity; leading model compression techniques on TensorFlow, PyTorch, and ONNX Runtime

License

Notifications You must be signed in to change notification settings

intel/neural-compressor

Folders and files

NameName
Last commit message
Last commit date

Latest commit

48d6329 · Jan 17, 2025
Jan 6, 2023
Jan 11, 2023
Jul 17, 2023
Jul 17, 2023
Oct 17, 2023
Aug 13, 2024
Aug 13, 2024
Aug 13, 2024
Aug 13, 2024
Jan 17, 2025
Dec 12, 2022
Dec 27, 2022
Dec 27, 2022
Aug 13, 2024

Repository files navigation

Security Policy

Intel is committed to rapidly addressing security vulnerabilities affecting our customers and providing clear guidance on the solution, impact, severity and mitigation.

Reporting a Vulnerability

Please report any security vulnerabilities in this project utilizing the guidelines here.