Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
intel
Fetched on 2025/12/26 09:39
intel
/
neural-compressor
SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime -
View it on GitHub
https://intel.github.io/neural-compressor/
Star
2557
Rank
15295