A Hard Multi-Turn Hallucination Benchmark - View it on GitHub
Star
23
Rank
863768