Code for "Accelerating Transformer Pre-training with 2:4 Sparsity" - View it on GitHub
Star
10
Rank
1287307