A benchmark dataset for evaluating dialog system and natural language generation metrics. - View it on GitHub
Star
35
Rank
546222