Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks - View it on GitHub
Star
231
Rank
141271