A Benchmark for Evaluating Multi-Hop, Multi-Source Tool-Calling in AI Agents - View it on GitHub
Star
61
Rank
448188