Procedural Knowledge at Scale Improves ReasoningThis repository contains the minimal, end-to-end pipeline for reproducing the paper results generate a procedural knowledge datastore, build retrieval indices, run retrieval, perform model rollouts with retrieved subroutines, and filter the samples to output the final metrics. -
View it on GitHub