MII makes low-latency and high-throughput inference possible, powered by DeepSpeed. - View it on GitHub
Star
0
Rank
11453169