some common Huggingface transformers in maximal update parametrization (µP) - View it on GitHub
Star
86
Rank
314250