a fast and user-friendly runtime for transformer inference (Bert, Albert, GPT2, Decoders, etc) on CPU and GPU. - View it on GitHub
Star
1420
Rank
22186