some common Huggingface transformers in maximal update parametrization (µP) - View it on GitHub
Star
87
Rank
316078