MII makes low-latency and high-throughput inference possible, powered by DeepSpeed. - View it on GitHub
Star
1890
Rank
18234