Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
gmh5225
Fetched on 2024/05/01 04:07
gmh5225
/
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights. -
View it on GitHub
Star
0
Rank
11564054