some common Huggingface transformers in maximal update parametrization (µP) - View it on GitHub
Star
77
Rank
304321