slime is a LLM post-training framework aiming at scaling RL. - View it on GitHub
Star
0
Rank
13250909