LLaMa 7b with CUDA acceleration implemented in rust. Minimal GPU memory needed! - View it on GitHub
Star
111
Rank
280353