Improved Deep Learning Baselines for Ubuntu Corpus Dialogs

Rudolf Kadlec, Martin Schmid, Jan Kleindienst

2015-10-13Deep Learning Conversational Response Selection

Abstract

This paper presents results of our experiments for the next utterance ranking on the Ubuntu Dialog Corpus -- the largest publicly available multi-turn dialog corpus. First, we use an in-house implementation of previously reported models to do an independent evaluation using the same data. Second, we evaluate the performances of various LSTMs, Bi-LSTMs and CNNs on the dataset. Third, we create an ensemble by averaging predictions of multiple models. The ensemble further improves the performance and it achieves a state-of-the-art result for the next utterance ranking on this dataset. Finally, we discuss our future plans using this corpus.

Results

Task	Dataset	Metric	Value	Model
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	R10@1	0.63	Dual-BiLSTM
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	R10@2	0.78	Dual-BiLSTM
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	R10@5	0.944	Dual-BiLSTM
Conversational Response Selection	Ubuntu Dialogue (v1, Ranking)	R2@1	0.895	Dual-BiLSTM

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18 A Survey of Deep Learning for Geometry Problem Solving2025-07-16 Uncertainty Quantification for Motor Imagery BCI -- Machine Learning vs. Deep Learning2025-07-10 Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems2025-07-08 Deep Learning Optimization of Two-State Pinching Antennas Systems2025-07-08 AXLearn: Modular Large Model Training on Heterogeneous Infrastructure2025-07-07 Determination Of Structural Cracks Using Deep Learning Frameworks2025-07-03 Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains2025-07-02