VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications - View it on GitHub
Star
20
Rank
980920