A benchmark dataset for evaluating dialog system and natural language generation metrics. - View it on GitHub
Star
38
Rank
571545