gmh5225/llm-eval-analysis - Gitstar Ranking

gmh5225

Fetched on 2026/05/08 11:55

Automatic multi-metric evaluation of human-bot dialogues using LLMs (Claude, GPT-4o) across different datasets and settings. Built for the Artificial Intelligence course at the University of Salerno. - View it on GitHub

Star

Rank

13993518

gmh5225

gmh5225 / llm-eval-analysis