Rethinking On-Policy Distillation of Large Language Models: Phenomenology, Mechanism, and Recipe - View it on GitHub
Star
534
Rank
76246