MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering - View it on GitHub
Star
1211
Rank
32937