TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Don't Give Me the Details, Just the Summary! Topic-Aware C...

Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization

Shashi Narayan, Shay B. Cohen, Mirella Lapata

2018-08-27EMNLP 2018 10Text SummarizationDocument SummarizationExtreme Summarization
PaperPDFCodeCodeCode(official)

Abstract

We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?". We collect a real-world, large-scale dataset for this task by harvesting online articles from the British Broadcasting Corporation (BBC). We propose a novel abstractive model which is conditioned on the article's topics and based entirely on convolutional neural networks. We demonstrate experimentally that this architecture captures long-range dependencies in a document and recognizes pertinent content, outperforming an oracle extractive system and state-of-the-art abstractive approaches when evaluated automatically and by humans.

Results

TaskDatasetMetricValueModel
Text SummarizationX-SumROUGE-131.89T-ConvS2S
Text SummarizationX-SumROUGE-211.54T-ConvS2S
Text SummarizationX-SumROUGE-325.75T-ConvS2S
Text SummarizationX-SumROUGE-129.79Baseline : Extractive Oracle
Text SummarizationX-SumROUGE-28.81Baseline : Extractive Oracle
Text SummarizationX-SumROUGE-322.66Baseline : Extractive Oracle
Text SummarizationX-SumROUGE-129.7PtGen
Text SummarizationX-SumROUGE-29.21PtGen
Text SummarizationX-SumROUGE-323.24PtGen
Text SummarizationX-SumROUGE-128.42Seq2Seq
Text SummarizationX-SumROUGE-28.77Seq2Seq
Text SummarizationX-SumROUGE-322.48Seq2Seq
Text SummarizationX-SumROUGE-128.1PtGen-Covg
Text SummarizationX-SumROUGE-28.02PtGen-Covg
Text SummarizationX-SumROUGE-321.72PtGen-Covg
Text SummarizationX-SumROUGE-116.3Baseline : Lead-3
Text SummarizationX-SumROUGE-21.6Baseline : Lead-3
Text SummarizationX-SumROUGE-311.95Baseline : Lead-3
Text SummarizationX-SumROUGE-115.16Baseline : Random
Text SummarizationX-SumROUGE-21.78Baseline : Random
Text SummarizationX-SumROUGE-311.27Baseline : Random

Related Papers

LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification2025-07-15GenerationPrograms: Fine-grained Attribution with Executable Programs2025-06-17Arctic Long Sequence Training: Scalable And Efficient Training For Multi-Million Token Sequences2025-06-16On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention2025-06-11Improving large language models with concept-aware fine-tuning2025-06-09Improving Fairness of Large Language Models in Multi-document Summarization2025-06-09MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection2025-05-29ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs2025-05-29