Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
mazzzystar
Fetched on 2026/01/31 17:14
mazzzystar
/
TurtleBench
TurtleBench: Evaluating Top Language Models via Real-World Yes/No Puzzles. -
View it on GitHub
https://arxiv.org/abs/2410.05262
Star
163
Rank
197041