TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Discourse-Aware Unsupervised Summarization of Long Scienti...

Discourse-Aware Unsupervised Summarization of Long Scientific Documents

Yue Dong, Andrei Mircea, Jackie C. K. Cheung

2020-05-01Unsupervised Extractive SummarizationExtractive Summarization
PaperPDFCode

Abstract

We propose an unsupervised graph-based ranking model for extractive summarization of long scientific documents. Our method assumes a two-level hierarchical graph representation of the source document, and exploits asymmetrical positional cues to determine sentence importance. Results on the PubMed and arXiv datasets show that our approach outperforms strong unsupervised baselines by wide margins in automatic metrics and human evaluation. In addition, it achieves performance comparable to many state-of-the-art supervised approaches which are trained on hundreds of thousands of examples. These results suggest that patterns in the discourse structure are a strong signal for determining importance in scientific articles.

Results

TaskDatasetMetricValueModel
SummarizationarXiv Summarization DatasetROUGE-139.34HipoRank
SummarizationarXiv Summarization DatasetROUGE-212.56HipoRank
SummarizationarXiv Summarization DatasetROUGE-L34.89HipoRank
SummarizationarXiv Summarization DatasetROUGE-138.57PacSum
SummarizationarXiv Summarization DatasetROUGE-210.93PacSum
SummarizationarXiv Summarization DatasetROUGE-L34.33PacSum
SummarizationPubmedROUGE-143.58HipoRank
SummarizationPubmedROUGE-217HipoRank
SummarizationPubmedROUGE-L39.31HipoRank
SummarizationPubmedROUGE-139.79PacSum
SummarizationPubmedROUGE-214PacSum
SummarizationPubmedROUGE-L36.09PacSum

Related Papers

StrucSum: Graph-Structured Reasoning for Long Document Extractive Summarization with LLMs2025-05-29SafeChat: A Framework for Building Trustworthy Collaborative Assistants and a Case Study of its Usefulness2025-04-08Advancements in Natural Language Processing for Automatic Text Summarization2025-02-27OrderSum: Semantic Sentence Ordering for Extractive Summarization2025-02-22Lotus: Creating Short Videos From Long Videos With Abstractive and Extractive Summarization2025-02-10State Space Models for Extractive Summarization in Low Resource Scenarios2025-01-24CHIMA: Headline-Guided Extractive Summarization for Thai News Articles2024-12-02A Novel Word Pair-based Gaussian Sentence Similarity Algorithm For Bengali Extractive Text Summarization2024-11-26