Benchmark and research code for the paper SWEET-RL Training Multi-Turn LLM Agents onCollaborative Reasoning Tasks - View it on GitHub
Star
235
Rank
140869