A Unified Model for Extractive and Abstractive Summarization using Inconsistency Loss

Wan-Ting Hsu, Chieh-Kai Lin, Ming-Ying Lee, Kerui Min, Jing Tang, Min Sun

2018-05-16ACL 2018 7Abstractive Text Summarization

Abstract

We propose a unified model combining the strength of extractive and abstractive summarization. On the one hand, a simple extractive model can obtain sentence-level attention with high ROUGE scores but less readable. On the other hand, a more complicated abstractive model can obtain word-level dynamic attention to generate a more readable paragraph. In our model, sentence-level attention is used to modulate the word-level attention such that words in less attended sentences are less likely to be generated. Moreover, a novel inconsistency loss function is introduced to penalize the inconsistency between two levels of attentions. By end-to-end training our model with the inconsistency loss and original losses of extractive and abstractive models, we achieve state-of-the-art ROUGE scores while being the most informative and readable summarization on the CNN/Daily Mail dataset in a solid human evaluation.

Results

Task	Dataset	Metric	Value	Model
Text Summarization	CNN / Daily Mail	ROUGE-1	40.68	end2end w/ inconsistency loss
Text Summarization	CNN / Daily Mail	ROUGE-2	17.97	end2end w/ inconsistency loss
Text Summarization	CNN / Daily Mail	ROUGE-L	37.13	end2end w/ inconsistency loss
Abstractive Text Summarization	CNN / Daily Mail	ROUGE-1	40.68	end2end w/ inconsistency loss
Abstractive Text Summarization	CNN / Daily Mail	ROUGE-2	17.97	end2end w/ inconsistency loss
Abstractive Text Summarization	CNN / Daily Mail	ROUGE-L	37.13	end2end w/ inconsistency loss

Related Papers

Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs2025-06-03 ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs2025-05-29 Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality2025-05-22 Enhancing Abstractive Summarization of Scientific Papers Using Structure Information2025-05-20 Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline2025-05-16 ProdRev: A DNN framework for empowering customers using generative pre-trained transformers2025-05-14 A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting2025-05-11 GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance2025-05-07