gmh5225/auto-round - Gitstar Ranking

gmh5225

Fetched on 2026/03/14 09:26

Advanced Quantization Algorithm for LLMs and VLMs, with support for CPU, Intel GPU, CUDA and HPU. Seamlessly integrated with Torchao, Transformers, and vLLM. Export your models effortlessly toautogptq, autoawq, gguf and autoround formats with higher accuracy even at extremely low bit precision. - View it on GitHub

Star

Rank

13908001

gmh5225

gmh5225 / auto-round