Yixuan Su, Deng Cai, Qingyu Zhou, Zibo Lin, Simon Baker, Yunbo Cao, Shuming Shi, Nigel Collier, Yan Wang
We study the learning of a matching model for dialogue response selection. Motivated by the recent finding that models trained with random negative samples are not ideal in real-world scenarios, we propose a hierarchical curriculum learning framework that trains the matching model in an "easy-to-difficult" scheme. Our learning framework consists of two complementary curricula: (1) corpus-level curriculum (CC); and (2) instance-level curriculum (IC). In CC, the model gradually increases its ability in finding the matching clues between the dialogue context and a response candidate. As for IC, it progressively strengthens the model's ability in identifying the mismatching information between the dialogue context and a response candidate. Empirical studies on three benchmark datasets with three state-of-the-art matching models demonstrate that the proposed learning framework significantly improves the model performance across various evaluation metrics.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Conversational Response Selection | Douban | MAP | 0.639 | SA-BERT+HCL |
| Conversational Response Selection | Douban | MRR | 0.681 | SA-BERT+HCL |
| Conversational Response Selection | Douban | P@1 | 0.514 | SA-BERT+HCL |
| Conversational Response Selection | Douban | R10@1 | 0.33 | SA-BERT+HCL |
| Conversational Response Selection | Douban | R10@2 | 0.531 | SA-BERT+HCL |
| Conversational Response Selection | Douban | R10@5 | 0.858 | SA-BERT+HCL |
| Conversational Response Selection | RRS | MAP | 0.671 | SA-BERT+HCL |
| Conversational Response Selection | RRS | MRR | 0.683 | SA-BERT+HCL |
| Conversational Response Selection | RRS | P@1 | 0.503 | SA-BERT+HCL |
| Conversational Response Selection | RRS | R10@1 | 0.454 | SA-BERT+HCL |
| Conversational Response Selection | RRS | R10@2 | 0.659 | SA-BERT+HCL |
| Conversational Response Selection | RRS | R10@5 | 0.917 | SA-BERT+HCL |
| Conversational Response Selection | E-commerce | R10@1 | 0.721 | SA-BERT+HCL |
| Conversational Response Selection | E-commerce | R10@2 | 0.896 | SA-BERT+HCL |
| Conversational Response Selection | E-commerce | R10@5 | 0.993 | SA-BERT+HCL |