Muon optimizer: +>30% sample efficiency with <3% wallclock overhead - View it on GitHub
Star
504
Rank
70732