TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking ...

SummaReranker: A Multi-Task Mixture-of-Experts Re-ranking Framework for Abstractive Summarization

Mathieu Ravaut, Shafiq Joty, Nancy F. Chen

2022-03-13ACL 2022 5Abstractive Text SummarizationText SummarizationDocument SummarizationRe-Ranking
PaperPDFCode(official)

Abstract

Sequence-to-sequence neural networks have recently achieved great success in abstractive summarization, especially through fine-tuning large pre-trained language models on the downstream dataset. These models are typically decoded with beam search to generate a unique summary. However, the search space is very large, and with the exposure bias, such decoding is not optimal. In this paper, we show that it is possible to directly train a second-stage model performing re-ranking on a set of summary candidates. Our mixture-of-experts SummaReranker learns to select a better candidate and consistently improves the performance of the base model. With a base PEGASUS, we push ROUGE scores by 5.44% on CNN-DailyMail (47.16 ROUGE-1), 1.31% on XSum (48.12 ROUGE-1) and 9.34% on Reddit TIFU (29.83 ROUGE-1), reaching a new state-of-the-art. Our code and checkpoints will be available at https://github.com/ntunlp/SummaReranker.

Results

TaskDatasetMetricValueModel
Text SummarizationReddit TIFUROUGE-129.83PEGASUS + SummaReranker
Text SummarizationReddit TIFUROUGE-29.5PEGASUS + SummaReranker
Text SummarizationReddit TIFUROUGE-L23.47PEGASUS + SummaReranker
Text SummarizationX-SumROUGE-148.12PEGASUS + SummaReranker
Text SummarizationX-SumROUGE-224.95PEGASUS + SummaReranker
Text SummarizationX-SumROUGE-L40PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-147.16PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-222.61PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-L43.87PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-147.16PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-222.55PEGASUS + SummaReranker
Text SummarizationCNN / Daily MailROUGE-L43.87PEGASUS + SummaReranker
Abstractive Text SummarizationCNN / Daily MailROUGE-147.16PEGASUS + SummaReranker
Abstractive Text SummarizationCNN / Daily MailROUGE-222.61PEGASUS + SummaReranker
Abstractive Text SummarizationCNN / Daily MailROUGE-L43.87PEGASUS + SummaReranker
Document SummarizationCNN / Daily MailROUGE-147.16PEGASUS + SummaReranker
Document SummarizationCNN / Daily MailROUGE-222.55PEGASUS + SummaReranker
Document SummarizationCNN / Daily MailROUGE-L43.87PEGASUS + SummaReranker

Related Papers

Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17MCoT-RE: Multi-Faceted Chain-of-Thought and Re-Ranking for Training-Free Zero-Shot Composed Image Retrieval2025-07-17LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification2025-07-15CATVis: Context-Aware Thought Visualization2025-07-15SAMURAI: Shape-Aware Multimodal Retrieval for 3D Object Identification2025-06-26RAG-VisualRec: An Open Resource for Vision- and Text-Enhanced Retrieval-Augmented Generation in Recommendation2025-06-25IRanker: Towards Ranking Foundation Model2025-06-25GenerationPrograms: Fine-grained Attribution with Executable Programs2025-06-17