TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Zero-Shot (Mistral Large)

Zero-Shot (Mistral Large)

Reported on 16 benchmarks across 1 task · 1 paper · 4 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Knowledge Base16 results

  • Text SummarizationonSummScreen
    ROUGE-2· 2025-05-30
    7.43
    SOTA
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonSummScreen
    ROUGE-L· 2025-05-30
    19.06
    SOTA
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMENSA
    ROUGE-L· 2025-05-30
    21.52
    SOTA
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMovieSum
    ROUGE-L· 2025-05-30
    22.55
    SOTA
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonBookSum
    BERTScore (F1)· 2025-05-30
    46.42
    best: 70.7 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonBookSum
    ROUGE-1· 2025-05-30
    19.63
    best: 42.51 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonBookSum
    ROUGE-2· 2025-05-30
    2.99
    best: 10.53 (Echoes-Extractive-Abstractive)
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonBookSum
    ROUGE-L· 2025-05-30
    12
    best: 23.91 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonSummScreen
    BERTScore (F1)· 2025-05-30
    57.23
    best: 61.59 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonSummScreen
    ROUGE-1· 2025-05-30
    29.18
    best: 30.44 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMENSA
    BERTScore (F1)· 2025-05-30
    54.8
    best: 65.73 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMENSA
    ROUGE-1· 2025-05-30
    37.43
    best: 44.91 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMENSA
    ROUGE-2· 2025-05-30
    10.52
    best: 11.43 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMovieSum
    BERTScore (F1)· 2025-05-30
    55.5
    best: 63.53 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMovieSum
    ROUGE-1· 2025-05-30
    39.22
    best: 44.91 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575
  • Text SummarizationonMovieSum
    ROUGE-2· 2025-05-30
    10.53
    best: 11.43 (NexusSum (Mistral Large))
    NexusSum: Hierarchical LLM Agents for Long-Form Narrative SummarizationarXiv:2505.24575