Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
liunix61
Fetched on 2026/05/08 12:14
liunix61
/
turboquant-pytorch
From-scratch PyTorch implementation of Google's TurboQuant (ICLR 2026) for LLM KV cache compression. 5x compression at 3-bit with 99.5% attention fidelity. -
View it on GitHub
Star
1
Rank
6063613