Zhuosheng Zhang, Jiangtong Li, Pengfei Zhu, Hai Zhao, Gongshen Liu
Multi-turn conversation understanding is a major challenge for building intelligent dialogue systems. This work focuses on retrieval-based response matching for multi-turn conversation whose related work simply concatenates the conversation utterances, ignoring the interactions among previous utterances for context modeling. In this paper, we formulate previous utterances into context using a proposed deep utterance aggregation model to form a fine-grained context representation. In detail, a self-matching attention is first introduced to route the vital information in each utterance. Then the model matches a response with each refined utterance and the final matching score is obtained after attentive turns aggregation. Experimental results show our model outperforms the state-of-the-art methods on three multi-turn conversation benchmarks, including a newly introduced e-commerce dialogue corpus.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Conversational Response Selection | Douban | MAP | 0.551 | DUA |
| Conversational Response Selection | Douban | MRR | 0.599 | DUA |
| Conversational Response Selection | Douban | P@1 | 0.421 | DUA |
| Conversational Response Selection | Douban | R10@1 | 0.243 | DUA |
| Conversational Response Selection | Douban | R10@2 | 0.421 | DUA |
| Conversational Response Selection | Douban | R10@5 | 0.78 | DUA |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@1 | 0.752 | DUA |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@2 | 0.868 | DUA |
| Conversational Response Selection | Ubuntu Dialogue (v1, Ranking) | R10@5 | 0.962 | DUA |
| Conversational Response Selection | E-commerce | R10@1 | 0.501 | DUA |
| Conversational Response Selection | E-commerce | R10@2 | 0.7 | DUA |
| Conversational Response Selection | E-commerce | R10@5 | 0.921 | DUA |