Metric: BERTScore (F1) (higher is better)
| # | Model↕ | BERTScore (F1)▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | NexusSum (Mistral Large) | 65.73 | No | NexusSum: Hierarchical LLM Agents for Long-Form ... | 2025-05-30 | - |
| 2 | CachED (BART Large) | 64.6 | No | End-to-End Long Document Summarization using Gra... | 2025-01-03 | - |
| 3 | Hierarchically Merging and Agent Refinement | 60.22 | No | Agent-as-Judge for Factual Summarization of Long... | 2025-01-17 | Code |
| 4 | Unlimiformer (BART Base) | 58.7 | No | End-to-End Long Document Summarization using Gra... | 2025-01-03 | - |
| 5 | SLED (BART Large) | 58.3 | No | End-to-End Long Document Summarization using Gra... | 2025-01-03 | - |
| 6 | SELECT & SUMM (LED) | 57.46 | No | Select and Summarize: Scene Saliency for Movie S... | 2024-04-04 | Code |
| 7 | Two-Stage Heuristic (LED Large) | 56.34 | No | Select and Summarize: Scene Saliency for Movie S... | 2024-04-04 | Code |
| 8 | Zero-Shot (Mistral Large) | 54.8 | No | NexusSum: Hierarchical LLM Agents for Long-Form ... | 2025-05-30 | - |
| 9 | Zero-Shot (GPT-4o) | 52.8 | No | End-to-End Long Document Summarization using Gra... | 2025-01-03 | - |
| 10 | SUMM-N Multi Stage | 40.87 | No | Select and Summarize: Scene Saliency for Movie S... | 2024-04-04 | Code |