Code for "Accelerating Transformer Pre-training with 2:4 Sparsity" - View it on GitHub
Star
17
Rank
939085