Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks - View it on GitHub
Star
175
Rank
174638