Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
gmh5225
Fetched on 2026/05/08 11:55
gmh5225
/
turboquant-pytorch
From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity. -
View it on GitHub
Star
0
Rank
13993518