A benchmark dataset for evaluating dialog system and natural language generation metrics.
No resources for this project.