Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
mazzzystar
Fetched on 2025/03/15 14:17
mazzzystar
/
TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles. -
View it on GitHub
https://arxiv.org/abs/2410.05262
Star
142
Rank
205747