TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Simple Language Model for Task-Oriented Dialogue

A Simple Language Model for Task-Oriented Dialogue

Ehsan Hosseini-Asl, Bryan McCann, Chien-Sheng Wu, Semih Yavuz, Richard Socher

2020-05-02NeurIPS 2020 12Multi-domain Dialogue State TrackingDialogue State TrackingTransfer LearningLanguage ModellingEnd-To-End Dialogue ModellingResponse Generation
PaperPDFCode(official)

Abstract

Task-oriented dialogue is often decomposed into three tasks: understanding user input, deciding actions, and generating a response. While such decomposition might suggest a dedicated model for each sub-task, we find a simple, unified approach leads to state-of-the-art performance on the MultiWOZ dataset. SimpleTOD is a simple approach to task-oriented dialogue that uses a single, causal language model trained on all sub-tasks recast as a single sequence prediction problem. This allows SimpleTOD to fully leverage transfer learning from pre-trained, open domain, causal language models such as GPT-2. SimpleTOD improves over the prior state-of-the-art in joint goal accuracy for dialogue state tracking, and our analysis reveals robustness to noisy annotations in this setting. SimpleTOD also improves the main metrics used to evaluate action decisions and response generation in an end-to-end setting: inform rate by 8.1 points, success rate by 9.7 points, and combined score by 7.2 points.

Results

TaskDatasetMetricValueModel
DialogueMULTIWOZ 2.0BLEU15SimpleTOD
DialogueMULTIWOZ 2.0MultiWOZ (Inform)84.4SimpleTOD
DialogueMULTIWOZ 2.0MultiWOZ (Success)70.1SimpleTOD
DialogueMULTIWOZ 2.1BLEU15.2SimpleTOD
DialogueMULTIWOZ 2.1MultiWOZ (Inform)85SimpleTOD
DialogueMULTIWOZ 2.1MultiWOZ (Success)70.5SimpleTOD
Response GenerationMMConvBLEU20.3SimpleTOD
Response GenerationMMConvComb.32.2SimpleTOD
Response GenerationMMConvInform14.6SimpleTOD
Response GenerationMMConvSuccess9.2SimpleTOD

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21RaMen: Multi-Strategy Multi-Modal Learning for Bundle Construction2025-07-18Disentangling coincident cell events using deep transfer learning and compressive sensing2025-07-17Making Language Model a Hierarchical Classifier and Generator2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17The Generative Energy Arena (GEA): Incorporating Energy Awareness in Large Language Model (LLM) Human Evaluations2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Best Practices for Large-Scale, Pixel-Wise Crop Mapping and Transfer Learning Workflows2025-07-16