TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/GALAXY: A Generative Pre-trained Model for Task-Oriented D...

GALAXY: A Generative Pre-trained Model for Task-Oriented Dialog with Semi-Supervised Learning and Explicit Policy Injection

Wanwei He, Yinpei Dai, Yinhe Zheng, Yuchuan Wu, Zheng Cao, Dermot Liu, Peng Jiang, Min Yang, Fei Huang, Luo Si, Jian Sun, Yongbin Li

2021-11-29End-To-End Dialogue Modelling
PaperPDFCode(official)

Abstract

Pre-trained models have proved to be powerful in enhancing task-oriented dialog systems. However, current pre-training methods mainly focus on enhancing dialog understanding and generation tasks while neglecting the exploitation of dialog policy. In this paper, we propose GALAXY, a novel pre-trained dialog model that explicitly learns dialog policy from limited labeled dialogs and large-scale unlabeled dialog corpora via semi-supervised learning. Specifically, we introduce a dialog act prediction task for policy optimization during pre-training and employ a consistency regularization term to refine the learned representation with the help of unlabeled dialogs. We also implement a gating mechanism to weigh suitable unlabeled dialog samples. Empirical results show that GALAXY substantially improves the performance of task-oriented dialog systems, and achieves new state-of-the-art results on benchmark datasets: In-Car, MultiWOZ2.0 and MultiWOZ2.1, improving their end-to-end combined scores by 2.5, 5.3 and 5.5 points, respectively. We also show that GALAXY has a stronger few-shot ability than existing models under various low-resource settings.

Results

TaskDatasetMetricValueModel
DialogueMULTIWOZ 2.0BLEU20.5GALAXY
DialogueMULTIWOZ 2.0MultiWOZ (Inform)94.4GALAXY
DialogueMULTIWOZ 2.0MultiWOZ (Success)85.3GALAXY
DialogueMULTIWOZ 2.1BLEU20.01GALAXY
DialogueMULTIWOZ 2.1MultiWOZ (Inform)95.3GALAXY
DialogueMULTIWOZ 2.1MultiWOZ (Success)86.2GALAXY

Related Papers

Cross-Lingual Dialogue Dataset Creation via Outline-Based Generation2022-01-31Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System2021-11-16Multi-Task Pre-Training for Plug-and-Play Task-Oriented Dialogue System2021-09-29Maintaining Common Ground in Dynamic Environments2021-05-29Pretraining the Noisy Channel Model for Task-Oriented Dialogue2021-03-18AuGPT: Auxiliary Tasks and Data Augmentation for End-To-End Dialogue with Pre-Trained Language Models2021-02-09A Probabilistic End-To-End Task-Oriented Dialog Model with Latent Belief States towards Semi-Supervised Learning2020-09-17SOLOIST: Building Task Bots at Scale with Transfer Learning and Machine Teaching2020-05-11