Code for Adam-mini: Use Fewer Learning Rates To Gain More https://arxiv.org/abs/2406.16793 - View it on GitHub
Star
1
Rank
5982720