Distributed training (multi-node) of a Transformer model - View it on GitHub
Star
58
Rank
410343