CMU DoG

CMU Document Grounded Conversations Dataset

Texts

This is a document grounded dataset for text conversations. "Document Grounded Conversations" are conversations that are about the contents of a specified document. In this dataset the specified documents are Wikipedia articles about popular movies. The dataset contains 4112 conversations with an average of 21.43 turns per conversation.

Source: https://github.com/festvox/datasets-CMU_DoG Image Source: https://arxiv.org/pdf/1809.07358v1.pdf