TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Multiresolution Recurrent Neural Networks: An Application ...

Multiresolution Recurrent Neural Networks: An Application to Dialogue Response Generation

Iulian Vlad Serban, Tim Klinger, Gerald Tesauro, Kartik Talamadupula, Bo-Wen Zhou, Yoshua Bengio, Aaron Courville

2016-06-02Text GenerationDialogue GenerationResponse Generation
PaperPDFCodeCodeCodeCode(official)

Abstract

We introduce the multiresolution recurrent neural network, which extends the sequence-to-sequence framework to model natural language generation as two parallel discrete stochastic processes: a sequence of high-level coarse tokens, and a sequence of natural language tokens. There are many ways to estimate or learn the high-level coarse tokens, but we argue that a simple extraction procedure is sufficient to capture a wealth of high-level discourse semantics. Such procedure allows training the multiresolution recurrent neural network by maximizing the exact joint log-likelihood over both sequences. In contrast to the standard log- likelihood objective w.r.t. natural language tokens (word perplexity), optimizing the joint log-likelihood biases the model towards modeling high-level abstractions. We apply the proposed model to the task of dialogue response generation in two challenging domains: the Ubuntu technical support domain, and Twitter conversations. On Ubuntu, the model outperforms competing approaches by a substantial margin, achieving state-of-the-art results according to both automatic evaluation metrics and a human evaluation study. On Twitter, the model appears to generate more relevant and on-topic responses according to automatic evaluation metrics. Finally, our experiments demonstrate that the proposed model is more adept at overcoming the sparsity of natural language and is better able to capture long-term structure.

Results

TaskDatasetMetricValueModel
DialogueUbuntu Dialogue (Activity)F111.43MrRNN Act.-Ent.
DialogueUbuntu Dialogue (Activity)Precision16.84MrRNN Act.-Ent.
DialogueUbuntu Dialogue (Activity)Recall9.72MrRNN Act.-Ent.
DialogueUbuntu Dialogue (Entity)F13.72MrRNN Act.-Ent.
DialogueUbuntu Dialogue (Entity)Precision4.91MrRNN Act.-Ent.
DialogueUbuntu Dialogue (Entity)Recall3.36MrRNN Act.-Ent.
DialogueTwitter Dialogue (Noun)F14.63MrRNN Act.-Ent.
DialogueTwitter Dialogue (Noun)Precision4.82MrRNN Act.-Ent.
DialogueTwitter Dialogue (Noun)Recall5.22MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Activity)F111.43MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Activity)Precision16.84MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Activity)Recall9.72MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Entity)F13.72MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Entity)Precision4.91MrRNN Act.-Ent.
Text GenerationUbuntu Dialogue (Entity)Recall3.36MrRNN Act.-Ent.
Text GenerationTwitter Dialogue (Noun)F14.63MrRNN Act.-Ent.
Text GenerationTwitter Dialogue (Noun)Precision4.82MrRNN Act.-Ent.
Text GenerationTwitter Dialogue (Noun)Recall5.22MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Activity)F111.43MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Activity)Precision16.84MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Activity)Recall9.72MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Entity)F13.72MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Entity)Precision4.91MrRNN Act.-Ent.
ChatbotUbuntu Dialogue (Entity)Recall3.36MrRNN Act.-Ent.
ChatbotTwitter Dialogue (Noun)F14.63MrRNN Act.-Ent.
ChatbotTwitter Dialogue (Noun)Precision4.82MrRNN Act.-Ent.
ChatbotTwitter Dialogue (Noun)Recall5.22MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Activity)F111.43MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Activity)Precision16.84MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Activity)Recall9.72MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Entity)F13.72MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Entity)Precision4.91MrRNN Act.-Ent.
Dialogue GenerationUbuntu Dialogue (Entity)Recall3.36MrRNN Act.-Ent.
Dialogue GenerationTwitter Dialogue (Noun)F14.63MrRNN Act.-Ent.
Dialogue GenerationTwitter Dialogue (Noun)Precision4.82MrRNN Act.-Ent.
Dialogue GenerationTwitter Dialogue (Noun)Recall5.22MrRNN Act.-Ent.

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15ZipVoice-Dialog: Non-Autoregressive Spoken Dialogue Generation with Flow Matching2025-07-12Exploiting Leaderboards for Large-Scale Distribution of Malicious Models2025-07-11