TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Gener...

OpenViDial 2.0: A Larger-Scale, Open-Domain Dialogue Generation Dataset with Visual Contexts

Shuhe Wang, Yuxian Meng, Xiaoya Li, Xiaofei Sun, Rongbin Ouyang, Jiwei Li

2021-09-27Multi-modal Dialogue GenerationDialogue Generation
PaperPDFCode(official)

Abstract

In order to better simulate the real human conversation process, models need to generate dialogue utterances based on not only preceding textual contexts but also visual contexts. However, with the development of multi-modal dialogue learning, the dataset scale gradually becomes a bottleneck. In this report, we release OpenViDial 2.0, a larger-scale open-domain multi-modal dialogue dataset compared to the previous version OpenViDial 1.0. OpenViDial 2.0 contains a total number of 5.6 million dialogue turns extracted from either movies or TV series from different resources, and each dialogue turn is paired with its corresponding visual context. We hope this large-scale dataset can help facilitate future researches on open-domain multi-modal dialog generation, e.g., multi-modal pretraining for dialogue generation.

Results

TaskDatasetMetricValueModel
DialogueOpenViDial 2.0BLEU1.99FV (w/o MI)
DialogueOpenViDial 2.0Dis-10.0056FV (w/o MI)
DialogueOpenViDial 2.0Dis-20.0431FV (w/o MI)
DialogueOpenViDial 2.0Dis-30.125FV (w/o MI)
DialogueOpenViDial 2.0Dis-40.2215FV (w/o MI)
DialogueOpenViDial 2.0BLEU1.97CV (w/o MI)
DialogueOpenViDial 2.0Dis-10.0041CV (w/o MI)
DialogueOpenViDial 2.0Dis-20.0353CV (w/o MI)
DialogueOpenViDial 2.0Dis-30.0999CV (w/o MI)
DialogueOpenViDial 2.0Dis-40.1726CV (w/o MI)
DialogueOpenViDial 2.0BLEU1.96NV (w/ MI)
DialogueOpenViDial 2.0Dis-10.0039NV (w/ MI)
DialogueOpenViDial 2.0Dis-20.0311NV (w/ MI)
DialogueOpenViDial 2.0Dis-30.0953NV (w/ MI)
DialogueOpenViDial 2.0Dis-40.163NV (w/ MI)
DialogueOpenViDial 2.0BLEU1.95NV (w/o MI)
DialogueOpenViDial 2.0Dis-10.0037NV (w/o MI)
DialogueOpenViDial 2.0Dis-20.0302NV (w/o MI)
DialogueOpenViDial 2.0Dis-30.0929NV (w/o MI)
DialogueOpenViDial 2.0Dis-40.1711NV (w/o MI)
Text GenerationOpenViDial 2.0BLEU1.99FV (w/o MI)
Text GenerationOpenViDial 2.0Dis-10.0056FV (w/o MI)
Text GenerationOpenViDial 2.0Dis-20.0431FV (w/o MI)
Text GenerationOpenViDial 2.0Dis-30.125FV (w/o MI)
Text GenerationOpenViDial 2.0Dis-40.2215FV (w/o MI)
Text GenerationOpenViDial 2.0BLEU1.97CV (w/o MI)
Text GenerationOpenViDial 2.0Dis-10.0041CV (w/o MI)
Text GenerationOpenViDial 2.0Dis-20.0353CV (w/o MI)
Text GenerationOpenViDial 2.0Dis-30.0999CV (w/o MI)
Text GenerationOpenViDial 2.0Dis-40.1726CV (w/o MI)
Text GenerationOpenViDial 2.0BLEU1.96NV (w/ MI)
Text GenerationOpenViDial 2.0Dis-10.0039NV (w/ MI)
Text GenerationOpenViDial 2.0Dis-20.0311NV (w/ MI)
Text GenerationOpenViDial 2.0Dis-30.0953NV (w/ MI)
Text GenerationOpenViDial 2.0Dis-40.163NV (w/ MI)
Text GenerationOpenViDial 2.0BLEU1.95NV (w/o MI)
Text GenerationOpenViDial 2.0Dis-10.0037NV (w/o MI)
Text GenerationOpenViDial 2.0Dis-20.0302NV (w/o MI)
Text GenerationOpenViDial 2.0Dis-30.0929NV (w/o MI)
Text GenerationOpenViDial 2.0Dis-40.1711NV (w/o MI)
ChatbotOpenViDial 2.0BLEU1.99FV (w/o MI)
ChatbotOpenViDial 2.0Dis-10.0056FV (w/o MI)
ChatbotOpenViDial 2.0Dis-20.0431FV (w/o MI)
ChatbotOpenViDial 2.0Dis-30.125FV (w/o MI)
ChatbotOpenViDial 2.0Dis-40.2215FV (w/o MI)
ChatbotOpenViDial 2.0BLEU1.97CV (w/o MI)
ChatbotOpenViDial 2.0Dis-10.0041CV (w/o MI)
ChatbotOpenViDial 2.0Dis-20.0353CV (w/o MI)
ChatbotOpenViDial 2.0Dis-30.0999CV (w/o MI)
ChatbotOpenViDial 2.0Dis-40.1726CV (w/o MI)
ChatbotOpenViDial 2.0BLEU1.96NV (w/ MI)
ChatbotOpenViDial 2.0Dis-10.0039NV (w/ MI)
ChatbotOpenViDial 2.0Dis-20.0311NV (w/ MI)
ChatbotOpenViDial 2.0Dis-30.0953NV (w/ MI)
ChatbotOpenViDial 2.0Dis-40.163NV (w/ MI)
ChatbotOpenViDial 2.0BLEU1.95NV (w/o MI)
ChatbotOpenViDial 2.0Dis-10.0037NV (w/o MI)
ChatbotOpenViDial 2.0Dis-20.0302NV (w/o MI)
ChatbotOpenViDial 2.0Dis-30.0929NV (w/o MI)
ChatbotOpenViDial 2.0Dis-40.1711NV (w/o MI)
Dialogue GenerationOpenViDial 2.0BLEU1.99FV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-10.0056FV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-20.0431FV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-30.125FV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-40.2215FV (w/o MI)
Dialogue GenerationOpenViDial 2.0BLEU1.97CV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-10.0041CV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-20.0353CV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-30.0999CV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-40.1726CV (w/o MI)
Dialogue GenerationOpenViDial 2.0BLEU1.96NV (w/ MI)
Dialogue GenerationOpenViDial 2.0Dis-10.0039NV (w/ MI)
Dialogue GenerationOpenViDial 2.0Dis-20.0311NV (w/ MI)
Dialogue GenerationOpenViDial 2.0Dis-30.0953NV (w/ MI)
Dialogue GenerationOpenViDial 2.0Dis-40.163NV (w/ MI)
Dialogue GenerationOpenViDial 2.0BLEU1.95NV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-10.0037NV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-20.0302NV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-30.0929NV (w/o MI)
Dialogue GenerationOpenViDial 2.0Dis-40.1711NV (w/o MI)

Related Papers

Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching2025-07-12SDialog: A Python Toolkit for Synthetic Dialogue Generation and Analysis2025-06-12Enhancing Medical Dialogue Generation through Knowledge Refinement and Dynamic Prompt Adjustment2025-06-12Proactive Assistant Dialogue Generation from Streaming Egocentric Videos2025-06-06ConsistentChat: Building Skeleton-Guided Consistent Dialogues for Large Language Models from Scratch2025-06-04CoVoMix2: Advancing Zero-Shot Dialogue Generation with Fully Non-Autoregressive Flow Matching2025-06-01Adaptive-VP: A Framework for LLM-Based Virtual Patients that Adapts to Trainees' Dialogue to Facilitate Nurse Communication Training2025-05-31