Yixin Liu, PengFei Liu
In this paper, we present a conceptually simple while empirically powerful framework for abstractive summarization, SimCLS, which can bridge the gap between the learning objective and evaluation metrics resulting from the currently dominated sequence-to-sequence learning framework by formulating text generation as a reference-free evaluation problem (i.e., quality estimation) assisted by contrastive learning. Experimental results show that, with minor modification over existing top-scoring systems, SimCLS can improve the performance of existing top-performing models by a large margin. Particularly, 2.51 absolute improvement against BART and 2.50 over PEGASUS w.r.t ROUGE-1 on the CNN/DailyMail dataset, driving the state-of-the-art performance to a new level. We have open-sourced our codes and results: https://github.com/yixinL7/SimCLS. Results of our proposed models have been deployed into ExplainaBoard platform, which allows researchers to understand our systems in a more fine-grained way.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Text Summarization | X-Sum | ROUGE-1 | 47.61 | PEGASUS + SimCLS |
| Text Summarization | X-Sum | ROUGE-2 | 24.57 | PEGASUS + SimCLS |
| Text Summarization | X-Sum | ROUGE-L | 39.44 | PEGASUS + SimCLS |
| Text Summarization | CNN / Daily Mail | ROUGE-1 | 46.67 | BART + SimCLS |
| Text Summarization | CNN / Daily Mail | ROUGE-2 | 22.15 | BART + SimCLS |
| Text Summarization | CNN / Daily Mail | ROUGE-L | 43.54 | BART + SimCLS |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-1 | 46.67 | BART + SimCLS |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-2 | 22.15 | BART + SimCLS |
| Abstractive Text Summarization | CNN / Daily Mail | ROUGE-L | 43.54 | BART + SimCLS |