Distributed training (multi-node) of a Transformer model - View it on GitHub
Star
94
Rank
316395