Improved Deep Learning Baselines for Ubuntu Corpus Dialogs
Rudolf Kadlec, Martin Schmid, Jan Kleindienst
Abstract
This paper presents results of our experiments for the next utterance ranking on the Ubuntu Dialog Corpus -- the largest publicly available multi-turn dialog corpus. First, we use an in-house implementation of previously reported models to do an independent evaluation using the same data. Second, we evaluate the performances of various LSTMs, Bi-LSTMs and CNNs on the dataset. Third, we create an ensemble by averaging predictions of multiple models. The ensemble further improves the performance and it achieves a state-of-the-art result for the next utterance ranking on this dataset. Finally, we discuss our future plans using this corpus.
Results
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@1 | 0.63 | Dual-BiLSTM |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@2 | 0.78 | Dual-BiLSTM |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@5 | 0.944 | Dual-BiLSTM |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R2@1 | 0.895 | Dual-BiLSTM |
Related Papers
Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18A Survey of Deep Learning for Geometry Problem Solving2025-07-16Uncertainty Quantification for Motor Imagery BCI -- Machine Learning vs. Deep Learning2025-07-10Chat-Ghosting: A Comparative Study of Methods for Auto-Completion in Dialog Systems2025-07-08Deep Learning Optimization of Two-State Pinching Antennas Systems2025-07-08AXLearn: Modular Large Model Training on Heterogeneous Infrastructure2025-07-07Determination Of Structural Cracks Using Deep Learning Frameworks2025-07-03Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains2025-07-02