TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Ranking Sentences for Extractive Summarization with Reinfo...

Ranking Sentences for Extractive Summarization with Reinforcement Learning

Shashi Narayan, Shay B. Cohen, Mirella Lapata

2018-02-23NAACL 2018 6Reinforcement LearningExtractive Text SummarizationDocument SummarizationExtractive Summarizationreinforcement-learning
PaperPDFCode(official)

Abstract

Single document summarization is the task of producing a shorter version of a document while preserving its principal information content. In this paper we conceptualize extractive summarization as a sentence ranking task and propose a novel training algorithm which globally optimizes the ROUGE evaluation metric through a reinforcement learning objective. We use our algorithm to train a neural summarization model on the CNN and DailyMail datasets and demonstrate experimentally that it outperforms state-of-the-art extractive and abstractive systems when evaluated automatically and by humans.

Results

TaskDatasetMetricValueModel
Text SummarizationCNN / Daily MailROUGE-140REFRESH
Text SummarizationCNN / Daily MailROUGE-218.2REFRESH
Text SummarizationCNN / Daily MailROUGE-L36.6REFRESH
Extractive Text SummarizationCNN / Daily MailROUGE-140REFRESH
Extractive Text SummarizationCNN / Daily MailROUGE-218.2REFRESH
Extractive Text SummarizationCNN / Daily MailROUGE-L36.6REFRESH

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17