A collection of benchmarks and datasets for evaluating LLM. - View it on GitHub
Star
1
Rank
5980881