SARG: A Novel Semi Autoregressive Generator for Multi-turn Incomplete Utterance Restoration

Mengzuo Huang, Feng Li, Wuhe Zou, Weidong Zhang

2020-08-04Text Generation Dialogue Rewriting

Abstract

Dialogue systems in open domain have achieved great success due to the easily obtained single-turn corpus and the development of deep learning, but the multi-turn scenario is still a challenge because of the frequent coreference and information omission. In this paper, we investigate the incomplete utterance restoration which has brought general improvement over multi-turn dialogue systems in recent studies. Meanwhile, jointly inspired by the autoregression for text generation and the sequence labeling for text editing, we propose a novel semi autoregressive generator (SARG) with the high efficiency and flexibility. Moreover, experiments on two benchmarks show that our proposed model significantly outperforms the state-of-the-art models in terms of quality and inference speed.

Results

Task	Dataset	Metric	Value	Model
Dialogue Rewriting	Multi-Rewrite	Rewriting F2	52.5	SARG (n_beam=5)
Dialogue Rewriting	Multi-Rewrite	Rewriting F3	46.4	SARG (n_beam=5)
Dialogue Rewriting	Multi-Rewrite	BLEU-1	92.2	SARG (greedy)
Dialogue Rewriting	Multi-Rewrite	BLEU-2	89.6	SARG (greedy)
Dialogue Rewriting	Multi-Rewrite	ROUGE-1	92.1	SARG (greedy)
Dialogue Rewriting	Multi-Rewrite	ROUGE-2	86	SARG (greedy)
Dialogue Rewriting	Multi-Rewrite	Rewriting F1	62.4	SARG (greedy)
Dialogue Rewriting	CANARD	BLEU	54.8	SARG

Related Papers

Making Language Model a Hierarchical Classifier and Generator2025-07-17 Mitigating Object Hallucinations via Sentence-Level Early Intervention2025-07-16 The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs2025-07-15 Seq vs Seq: An Open Suite of Paired Encoders and Decoders2025-07-15 Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15 Exploiting Leaderboards for Large-Scale Distribution of Malicious Models2025-07-11 CLI-RAG: A Retrieval-Augmented Framework for Clinically Structured and Context Aware Text Generation with LLMs2025-07-09 FIFA: Unified Faithfulness Evaluation Framework for Text-to-Video and Video-to-Text Generation2025-07-09