TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/A Discourse-Aware Attention Model for Abstractive Summariz...

A Discourse-Aware Attention Model for Abstractive Summarization of Long Documents

Arman Cohan, Franck Dernoncourt, Doo Soon Kim, Trung Bui, Seokhwan Kim, Walter Chang, Nazli Goharian

2018-04-16NAACL 2018 6Unsupervised Extractive SummarizationAbstractive Text SummarizationText Summarization
PaperPDFCode(official)CodeCode

Abstract

Neural abstractive summarization models have led to promising results in summarizing relatively short documents. We propose the first model for abstractive summarization of single, longer-form documents (e.g., research papers). Our approach consists of a new hierarchical encoder that models the discourse structure of a document, and an attentive discourse-aware decoder to generate the summary. Empirical results on two large-scale datasets of scientific papers show that our model significantly outperforms state-of-the-art models.

Results

TaskDatasetMetricValueModel
SummarizationarXiv Summarization DatasetROUGE-133.85LexRank
SummarizationarXiv Summarization DatasetROUGE-210.73LexRank
SummarizationarXiv Summarization DatasetROUGE-L28.99LexRank
SummarizationarXiv Summarization DatasetROUGE-129.91LSA
SummarizationarXiv Summarization DatasetROUGE-27.42LSA
SummarizationarXiv Summarization DatasetROUGE-L25.67LSA
SummarizationarXiv Summarization DatasetROUGE-129.47SumBasic
SummarizationarXiv Summarization DatasetROUGE-26.95SumBasic
SummarizationarXiv Summarization DatasetROUGE-L26.3SumBasic
SummarizationPubmedROUGE-139.19LexRank
SummarizationPubmedROUGE-213.89LexRank
SummarizationPubmedROUGE-L34.59LexRank
SummarizationPubmedROUGE-137.15SumBasic
SummarizationPubmedROUGE-211.36SumBasic
SummarizationPubmedROUGE-L33.43SumBasic
SummarizationPubmedROUGE-133.89LSA
SummarizationPubmedROUGE-29.93LSA
SummarizationPubmedROUGE-L29.7LSA
Text SummarizationArxiv HEP-TH citation graphROUGE-135.8Discourse
Text SummarizationPubmedROUGE-138.93Discourse

Related Papers

LRCTI: A Large Language Model-Based Framework for Multi-Step Evidence Retrieval and Reasoning in Cyber Threat Intelligence Credibility Verification2025-07-15On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention2025-06-11Improving large language models with concept-aware fine-tuning2025-06-09Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs2025-06-03ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs2025-05-29MaCP: Minimal yet Mighty Adaptation via Hierarchical Cosine Projection2025-05-29APE: A Data-Centric Benchmark for Efficient LLM Adaptation in Text Summarization2025-05-26FiLLM -- A Filipino-optimized Large Language Model based on Southeast Asia Large Language Model (SEALLM)2025-05-25