Benchmarking long-form factuality in large language models. Original code for our paper "Long-form factuality in large language models". - View it on GitHub
Star
593
Rank
61280