Model parallel transformers in JAX and Haiku - View it on GitHub
Star
6364
Rank
5819