Shashi Narayan, Shay B. Cohen, Mirella Lapata
We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?". We collect a real-world, large-scale dataset for this task by harvesting online articles from the British Broadcasting Corporation (BBC). We propose a novel abstractive model which is conditioned on the article's topics and based entirely on convolutional neural networks. We demonstrate experimentally that this architecture captures long-range dependencies in a document and recognizes pertinent content, outperforming an oracle extractive system and state-of-the-art abstractive approaches when evaluated automatically and by humans.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Summarization | X-Sum | ROUGE-1 | 31.89 | T-ConvS2S |
| Text Summarization | X-Sum | ROUGE-2 | 11.54 | T-ConvS2S |
| Text Summarization | X-Sum | ROUGE-3 | 25.75 | T-ConvS2S |
| Text Summarization | X-Sum | ROUGE-1 | 29.79 | Baseline : Extractive Oracle |
| Text Summarization | X-Sum | ROUGE-2 | 8.81 | Baseline : Extractive Oracle |
| Text Summarization | X-Sum | ROUGE-3 | 22.66 | Baseline : Extractive Oracle |
| Text Summarization | X-Sum | ROUGE-1 | 29.7 | PtGen |
| Text Summarization | X-Sum | ROUGE-2 | 9.21 | PtGen |
| Text Summarization | X-Sum | ROUGE-3 | 23.24 | PtGen |
| Text Summarization | X-Sum | ROUGE-1 | 28.42 | Seq2Seq |
| Text Summarization | X-Sum | ROUGE-2 | 8.77 | Seq2Seq |
| Text Summarization | X-Sum | ROUGE-3 | 22.48 | Seq2Seq |
| Text Summarization | X-Sum | ROUGE-1 | 28.1 | PtGen-Covg |
| Text Summarization | X-Sum | ROUGE-2 | 8.02 | PtGen-Covg |
| Text Summarization | X-Sum | ROUGE-3 | 21.72 | PtGen-Covg |
| Text Summarization | X-Sum | ROUGE-1 | 16.3 | Baseline : Lead-3 |
| Text Summarization | X-Sum | ROUGE-2 | 1.6 | Baseline : Lead-3 |
| Text Summarization | X-Sum | ROUGE-3 | 11.95 | Baseline : Lead-3 |
| Text Summarization | X-Sum | ROUGE-1 | 15.16 | Baseline : Random |
| Text Summarization | X-Sum | ROUGE-2 | 1.78 | Baseline : Random |
| Text Summarization | X-Sum | ROUGE-3 | 11.27 | Baseline : Random |