Gitstar Ranking
Users
Organizations
Repositories
Rankings
Users
Organizations
Repositories
Sign in with GitHub
philschmid
Fetched on 2026/03/14 10:05
philschmid
/
ai-agent-benchmark-compendium
Compendium of over 50 benchmarks for evaluating AI agents, categorized into Function Calling & Tool Use, General Assistant & Reasoning, Coding & Software Engineering, and Computer Interaction. -
View it on GitHub
Star
111
Rank
279023