Text Summarization on MENSA
Metric: ROUGE-2 (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | ROUGE-2▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | NexusSum (Mistral Large) | 11.43 | No | NexusSum: Hierarchical LLM Agents for Long-Form ... | 2025-05-30 | - |
| 2 | Zero-Shot (Mistral Large) | 10.52 | No | NexusSum: Hierarchical LLM Agents for Long-Form ... | 2025-05-30 | - |
| 3 | Hierarchically Merging and Agent Refinement | 8.81 | No | Agent-as-Judge for Factual Summarization of Long... | 2025-01-17 | Code |