Transformers-compatible library for applying various compression algorithms to LLMs for optimized deployment with vLLM - View it on GitHub
Star
0
Rank
12125876