Model parallel transformers in JAX and Haiku - View it on GitHub
Star
6319
Rank
4869