TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Knowledge Base/Text Summarization/MENSA

Text Summarization on MENSA

Metric: BERTScore (F1) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕BERTScore (F1)▼Extra DataPaperDate↕Code
1NexusSum (Mistral Large)65.73NoNexusSum: Hierarchical LLM Agents for Long-Form ...2025-05-30-
2CachED (BART Large)64.6NoEnd-to-End Long Document Summarization using Gra...2025-01-03-
3Hierarchically Merging and Agent Refinement60.22NoAgent-as-Judge for Factual Summarization of Long...2025-01-17Code
4Unlimiformer (BART Base)58.7NoEnd-to-End Long Document Summarization using Gra...2025-01-03-
5SLED (BART Large)58.3NoEnd-to-End Long Document Summarization using Gra...2025-01-03-
6SELECT & SUMM (LED)57.46NoSelect and Summarize: Scene Saliency for Movie S...2024-04-04Code
7Two-Stage Heuristic (LED Large)56.34NoSelect and Summarize: Scene Saliency for Movie S...2024-04-04Code
8Zero-Shot (Mistral Large)54.8NoNexusSum: Hierarchical LLM Agents for Long-Form ...2025-05-30-
9Zero-Shot (GPT-4o)52.8NoEnd-to-End Long Document Summarization using Gra...2025-01-03-
10SUMM-N Multi Stage40.87NoSelect and Summarize: Scene Saliency for Movie S...2024-04-04Code