benchmarking large language models (LLMs) with a focus on their mathematical capabilities - View it on GitHub
Star
2
Rank
4216806