TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/EmailSum: Abstractive Email Thread Summarization

EmailSum: Abstractive Email Thread Summarization

Shiyue Zhang, Asli Celikyilmaz, Jianfeng Gao, Mohit Bansal

2021-07-30ACL 2021 5Abstractive Text SummarizationEmail Thread Summarization
PaperPDFCode(official)

Abstract

Recent years have brought about an interest in the challenging task of summarizing conversation threads (meetings, online discussions, etc.). Such summaries help analysis of the long text to quickly catch up with the decisions made and thus improve our work or communication efficiency. To spur research in thread summarization, we have developed an abstractive Email Thread Summarization (EmailSum) dataset, which contains human-annotated short (<30 words) and long (<100 words) summaries of 2549 email threads (each containing 3 to 10 emails) over a wide variety of topics. We perform a comprehensive empirical study to explore different summarization techniques (including extractive and abstractive methods, single-document and hierarchical models, as well as transfer and semisupervised learning) and conduct human evaluations on both short and long summary generation tasks. Our results reveal the key challenges of current abstractive summarization models in this task, such as understanding the sender's intent and identifying the roles of sender and receiver. Furthermore, we find that widely used automatic evaluation metrics (ROUGE, BERTScore) are weakly correlated with human judgments on this email thread summarization task. Hence, we emphasize the importance of human evaluation and the development of better metrics by the community. Our code and summary data have been made available at: https://github.com/ZhangShiyue/EmailSum

Results

TaskDatasetMetricValueModel
Text SummarizationEmailSum (short)BertS33.91SemiSuptogether
Text SummarizationEmailSum (short)RLsum33.7SemiSuptogether
Text SummarizationEmailSum (short)ROUGE-136.98SemiSuptogether
Text SummarizationEmailSum (short)ROUGE-211.21SemiSuptogether
Text SummarizationEmailSum (short)ROUGE-L28.76SemiSuptogether
Text SummarizationEmailSum (short)BertS33.9T5base
Text SummarizationEmailSum (short)RLsum32.76T5base
Text SummarizationEmailSum (short)ROUGE-136.57T5base
Text SummarizationEmailSum (short)ROUGE-210.56T5base
Text SummarizationEmailSum (short)ROUGE-L28.3T5base
Text SummarizationEmailSum (short)BertS22.32Oracle
Text SummarizationEmailSum (short)RLsum35.61Oracle
Text SummarizationEmailSum (short)ROUGE-139.04Oracle
Text SummarizationEmailSum (short)ROUGE-212.47Oracle
Text SummarizationEmailSum (short)ROUGE-L30.17Oracle
Text SummarizationEmailSum (long)BertS32.3SemiSuptogether
Text SummarizationEmailSum (long)RLsum40.67SemiSuptogether
Text SummarizationEmailSum (long)ROUGE-144.08SemiSuptogether
Text SummarizationEmailSum (long)ROUGE-214.06SemiSuptogether
Text SummarizationEmailSum (long)ROUGE-L31.17SemiSuptogether
Text SummarizationEmailSum (long)BertS32.09T5base
Text SummarizationEmailSum (long)RLsum39.88T5base
Text SummarizationEmailSum (long)ROUGE-143.81T5base
Text SummarizationEmailSum (long)ROUGE-214.08T5base
Text SummarizationEmailSum (long)ROUGE-L30.47T5base
Text SummarizationEmailSum (long)BertS26.31Oracle
Text SummarizationEmailSum (long)RLsum42.14Oracle
Text SummarizationEmailSum (long)ROUGE-145.98Oracle
Text SummarizationEmailSum (long)ROUGE-215.49Oracle
Text SummarizationEmailSum (long)ROUGE-L32.4Oracle
Document SummarizationEmailSum (short)BertS33.91SemiSuptogether
Document SummarizationEmailSum (short)RLsum33.7SemiSuptogether
Document SummarizationEmailSum (short)ROUGE-136.98SemiSuptogether
Document SummarizationEmailSum (short)ROUGE-211.21SemiSuptogether
Document SummarizationEmailSum (short)ROUGE-L28.76SemiSuptogether
Document SummarizationEmailSum (short)BertS33.9T5base
Document SummarizationEmailSum (short)RLsum32.76T5base
Document SummarizationEmailSum (short)ROUGE-136.57T5base
Document SummarizationEmailSum (short)ROUGE-210.56T5base
Document SummarizationEmailSum (short)ROUGE-L28.3T5base
Document SummarizationEmailSum (short)BertS22.32Oracle
Document SummarizationEmailSum (short)RLsum35.61Oracle
Document SummarizationEmailSum (short)ROUGE-139.04Oracle
Document SummarizationEmailSum (short)ROUGE-212.47Oracle
Document SummarizationEmailSum (short)ROUGE-L30.17Oracle
Document SummarizationEmailSum (long)BertS32.3SemiSuptogether
Document SummarizationEmailSum (long)RLsum40.67SemiSuptogether
Document SummarizationEmailSum (long)ROUGE-144.08SemiSuptogether
Document SummarizationEmailSum (long)ROUGE-214.06SemiSuptogether
Document SummarizationEmailSum (long)ROUGE-L31.17SemiSuptogether
Document SummarizationEmailSum (long)BertS32.09T5base
Document SummarizationEmailSum (long)RLsum39.88T5base
Document SummarizationEmailSum (long)ROUGE-143.81T5base
Document SummarizationEmailSum (long)ROUGE-214.08T5base
Document SummarizationEmailSum (long)ROUGE-L30.47T5base
Document SummarizationEmailSum (long)BertS26.31Oracle
Document SummarizationEmailSum (long)RLsum42.14Oracle
Document SummarizationEmailSum (long)ROUGE-145.98Oracle
Document SummarizationEmailSum (long)ROUGE-215.49Oracle
Document SummarizationEmailSum (long)ROUGE-L32.4Oracle

Related Papers

Advancing Decoding Strategies: Enhancements in Locally Typical Sampling for LLMs2025-06-03ARC: Argument Representation and Coverage Analysis for Zero-Shot Long Document Summarization with Instruction Following LLMs2025-05-29Power-Law Decay Loss for Large Language Model Finetuning: Focusing on Information Sparsity to Enhance Generation Quality2025-05-22Enhancing Abstractive Summarization of Scientific Papers Using Structure Information2025-05-20Low-Resource Language Processing: An OCR-Driven Summarization and Translation Pipeline2025-05-16ProdRev: A DNN framework for empowering customers using generative pre-trained transformers2025-05-14A Split-then-Join Approach to Abstractive Summarization for Very Long Documents in a Low Resource Setting2025-05-11GASCADE: Grouped Summarization of Adverse Drug Event for Enhanced Cancer Pharmacovigilance2025-05-07