intel/neural-compressor - Gitstar Ranking

intel

Fetched on 2026/05/31 12:29

SOTA low-bit LLM quantization (INT8/FP8/MXFP8/INT4/MXFP4/NVFP4) & sparsity; leading model compression techniques on PyTorch, TensorFlow, and ONNX Runtime - View it on GitHub

https://intel.github.io/neural-compressor/

Star

2648

Rank

16168

intel

intel / neural-compressor