A fast, local, and secure approach for training LLMs for coding tasks using GRPO with WebAssembly and interpreter feedback. - View it on GitHub
Star
0
Rank
13708167