Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
ParisNeo
Fetched on 2025/03/15 12:23
ParisNeo
/
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. -
View it on GitHub
Star
2
Rank
3685759