Sebastian Gehrmann, Yuntian Deng, Alexander M. Rush
Neural network-based methods for abstractive summarization produce outputs that are more fluent than other techniques, but which can be poor at content selection. This work proposes a simple technique for addressing this issue: use a data-efficient content selector to over-determine phrases in a source document that should be part of the summary. We use this selector as a bottom-up attention step to constrain the model to likely phrases. We show that this approach improves the ability to compress text, while still generating fluent summaries. This two-step process is both simpler and higher performing than other end-to-end content selection models, leading to significant improvements on ROUGE for both the CNN-DM and NYT corpus. Furthermore, the content selector can be trained with as little as 1,000 sentences, making it easy to transfer a trained summarizer to a new domain.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Generation | Multi-News | ROUGE-1 | 43.57 | CopyTransformer |
| Text Generation | Multi-News | ROUGE-2 | 14.03 | CopyTransformer |
| Text Generation | Multi-News | ROUGE-SU4 | 17.37 | CopyTransformer |
| Text Generation | Multi-News | ROUGE-1 | 42.8 | PG-BRNN |
| Text Generation | Multi-News | ROUGE-2 | 14.19 | PG-BRNN |
| Text Generation | Multi-News | ROUGE-SU4 | 16.75 | PG-BRNN |
| Text Summarization | CNN / Daily Mail | ROUGE-1 | 41.22 | Bottom-Up Summarization |
| Text Summarization | CNN / Daily Mail | ROUGE-2 | 18.68 | Bottom-Up Summarization |
| Text Summarization | CNN / Daily Mail | ROUGE-L | 38.34 | Bottom-Up Summarization |
| Text Summarization | CNN / Daily Mail | PPL | 32.75 | Bottom-Up Sum |
| Text Summarization | CNN / Daily Mail | ROUGE-1 | 41.22 | Bottom-Up Sum |
| Text Summarization | CNN / Daily Mail | ROUGE-2 | 18.68 | Bottom-Up Sum |
| Text Summarization | CNN / Daily Mail | ROUGE-L | 38.34 | Bottom-Up Sum |
| Text Summarization | Multi-News | ROUGE-1 | 43.57 | CopyTransformer |
| Text Summarization | Multi-News | ROUGE-2 | 14.03 | CopyTransformer |
| Text Summarization | Multi-News | ROUGE-SU4 | 17.37 | CopyTransformer |
| Text Summarization | Multi-News | ROUGE-1 | 42.8 | PG-BRNN |
| Text Summarization | Multi-News | ROUGE-2 | 14.19 | PG-BRNN |
| Text Summarization | Multi-News | ROUGE-SU4 | 16.75 | PG-BRNN |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-1 | 41.22 | Bottom-Up Summarization |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-2 | 18.68 | Bottom-Up Summarization |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-L | 38.34 | Bottom-Up Summarization |
| Document Summarization | CNN / Daily Mail | PPL | 32.75 | Bottom-Up Sum |
| Document Summarization | CNN / Daily Mail | ROUGE-1 | 41.22 | Bottom-Up Sum |
| Document Summarization | CNN / Daily Mail | ROUGE-2 | 18.68 | Bottom-Up Sum |
| Document Summarization | CNN / Daily Mail | ROUGE-L | 38.34 | Bottom-Up Sum |