Official NVIDIA/srt-slurm sweep configs for benchmarking LLMs across NVIDIA GPUs and frameworks, spanning aggregated and disaggregated serving on single- and multi-node setups. - View it on GitHub
Star
2
Rank
4248636