An optimized quantization and inference library for running LLMs locally on modern consumer-class GPUs - View it on GitHub
Star
0
Rank
13222826