A single-binary, GPU-accelerated LLM server (HTTP and WebSocket API) written in Rust - View it on GitHub
Star
0
Rank
13844299