intel/auto-round - Gitstar Ranking

intel

Fetched on 2026/05/31 12:29

A SOTA quantization algorithm for high-accuracy low-bit LLM inference, seamlessly optimized for CPU/XPU/CUDA, with multi-datatype support and full compatibility with vLLM, SGLang, and Transformers. - View it on GitHub

Star

1424

Rank

30867

intel

intel / auto-round