TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Reinforced Topic-Aware Convolutional Sequence-to-Sequenc...

A Reinforced Topic-Aware Convolutional Sequence-to-Sequence Model for Abstractive Text Summarization

Li Wang, Junlin Yao, Yunzhe Tao, Li Zhong, Wei Liu, Qiang Du

2018-05-09Abstractive Text SummarizationText SummarizationInformativeness
PaperPDF

Abstract

In this paper, we propose a deep learning approach to tackle the automatic summarization tasks by incorporating topic information into the convolutional sequence-to-sequence (ConvS2S) model and using self-critical sequence training (SCST) for optimization. Through jointly attending to topics and word-level alignment, our approach can improve coherence, diversity, and informativeness of generated summaries via a biased probability generation mechanism. On the other hand, reinforcement training, like SCST, directly optimizes the proposed model with respect to the non-differentiable metric ROUGE, which also avoids the exposure bias during inference. We carry out the experimental evaluation with state-of-the-art methods over the Gigaword, DUC-2004, and LCSTS datasets. The empirical results demonstrate the superiority of our proposed method in the abstractive summarization.

Results

TaskDatasetMetricValueModel
Text SummarizationDUC 2004 Task 1ROUGE-131.15Reinforced-Topic-ConvS2S
Text SummarizationDUC 2004 Task 1ROUGE-210.85Reinforced-Topic-ConvS2S
Text SummarizationDUC 2004 Task 1ROUGE-L27.68Reinforced-Topic-ConvS2S
Text SummarizationGigaWordROUGE-136.92Reinforced-Topic-ConvS2S
Text SummarizationGigaWordROUGE-218.29Reinforced-Topic-ConvS2S
Text SummarizationGigaWordROUGE-L34.58Reinforced-Topic-ConvS2S

Related Papers

LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification2025-07-15Multi-Agent Retrieval-Augmented Framework for Evidence-Based Counterspeech Against Health Misinformation2025-07-09LumiCRS: Asymmetric Contrastive Prototype Learning for Long-Tail Conversational Movie Recommendation2025-07-07Dynamic Bandwidth Allocation for Hybrid Event-RGB Transmission2025-06-25Multi-Preference Lambda-weighted Listwise DPO for Dynamic Preference Alignment2025-06-24On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention2025-06-11Improving large language models with concept-aware fine-tuning2025-06-09CuRe: Cultural Gaps in the Long Tail of Text-to-Image Systems2025-06-09