some common Huggingface transformers in maximal update parametrization (µP) - View it on GitHub
Star
82
Rank
320703