This is a replicate of DeepSeek-R1-Zero and DeepSeek-R1 training on small models with limited data - View it on GitHub
Star
0
Rank
13818244