liguodongiot/GPTQModel - Gitstar Ranking

liguodongiot

Fetched on 2026/01/31 19:06

Production ready LLM model compression/quantization toolkit with accelerated inference support for both cpu/gpu via HF, vLLM, and SGLang. - View it on GitHub

Star

Rank

5769420

liguodongiot

liguodongiot / GPTQModel