Swarm training framework using Haiku + JAX + Ray for layer parallel transformer language models on unreliable, heterogeneous nodes - View it on GitHub
Star
229
Rank
123054