A light-weight tool for evaluating LLMs in rule-based ways. - View it on GitHub
Star
70
Rank
368603