gmh5225/critic-rubrics - Gitstar Ranking

gmh5225

Fetched on 2026/03/14 09:26

Official repo for paper "A Rubric-Supervised Critic from Sparse Real-World Outcomes". Type-safe function-calling-based LLM-as-judge evaluation framework for agent behavior prediction and analysis. - View it on GitHub

https://arxiv.org/abs/2603.03800

Star

Rank

13887920

gmh5225

gmh5225 / critic-rubrics