Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
gmh5225
Fetched on 2026/05/08 11:55
gmh5225
/
turboquant-kv
Open-source PyTorch implementation of Google TurboQuant (ICLR 2026) — extreme KV-cache quantization to ~3 bits with zero accuracy loss. 6x less memory, up to 8x faster inference. -
View it on GitHub
https://pypi.org/project/turboquant-kv
Star
0
Rank
13993518