MARSHAL: Incentivizing Multi-Agent Reasoning via Self-Play with Strategic LLMs - View it on GitHub
Star
38
Rank
628719