slime is an LLM post-training framework for RL Scaling.(Fork for contributing. All changes intended for upstream PRs.) - View it on GitHub
Star
3
Rank
3375095