Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models". - View it on GitHub
Star
570
Rank
59335