Simple framework for training and evaluating math reasoning agents using local models, GRPO and vLLM. - View it on GitHub
Star
6
Rank
2075137