A benchmark dataset for evaluating dialog system and natural language generation metrics. - View it on GitHub
Star
36
Rank
582689