audiolabs/human-evaluation-of-llm

audiolabs

Fetched on 2026/03/03 08:51

This repository provides code and data associated with the paper entitled "Which Method(s) to Pick when Evaluating Large Language Models with Humans? -- A comparison of 6 methods." - View it on GitHub

Star

Rank

13988452

audiolabs

audiolabs / human-evaluation-of-llm