Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
ParisNeo
Fetched on 2024/05/01 03:54
ParisNeo
/
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. -
View it on GitHub
Star
1
Rank
4891825