ML-Dev-Bench is a benchmark for evaluating AI agents against various ML development tasks. - View it on GitHub
Star
1
Rank
6012284