A framework for generating realistic LLM serving workloads - View it on GitHub
Star
67
Rank
379048