Model parallel transformers in JAX and Haiku - View it on GitHub
Star
6221
Rank
4782