TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Longformer

Longformer

Reported on 28 benchmarks across 12 tasks · 6 papers · 17 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing22 results

  • Binary text classificationonMAGE (Arbitrary-domains & Arbitrary-models)
    Average Recall· 2023-05-22
    0.9053
    best: 0.9611 (GigaCheck (Mistral-7B))
    SOTA
    MAGE: Machine-generated Text Detection in the WildarXiv:2305.13242
  • Question AnsweringonMuLD (NarrativeQA)
    BLEU-1· 2022-02-15
    19.84
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (NarrativeQA)
    BLEU-4· 2022-02-15
    62
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (NarrativeQA)
    METEOR· 2022-02-15
    4.52
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (NarrativeQA)
    Rouge-L· 2022-02-15
    22.09
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (HotpotQA)
    BLEU-1· 2022-02-15
    30.38
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (HotpotQA)
    BLEU-4· 2022-02-15
    16.76
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (HotpotQA)
    METEOR· 2022-02-15
    4.98
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Question AnsweringonMuLD (HotpotQA)
    Rouge-L· 2022-02-15
    30.49
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • SummarizationonMuLD (VLSP)
    BLEU-1· 2022-02-15
    46.74
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • SummarizationonMuLD (VLSP)
    METEOR· 2022-02-15
    9.58
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • SummarizationonMuLD (VLSP)
    Rouge-L· 2022-02-15
    19.52
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Text ClassificationonMuLD (Character Type)
    F1· 2022-02-15
    82.58
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • TranslationonMuLD (OpenSubtitles)
    BLEU-4· 2022-02-15
    20
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Text ClassificationonUK Key Stage Readability
    F1· 2024-11-26
    74
    best: 99.6 (ELECTRA + ANN)
    What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational LinguisticsarXiv:2411.17593
  • SummarizationonMuLD (VLSP)
    BLEU-4· 2022-02-15
    3.05
    best: 84 (T5)
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • TranslationonMuLD (OpenSubtitles)
    BLEU-1· 2022-02-15
    22.74
    best: 34.07 (T5)
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • TranslationonMuLD (OpenSubtitles)
    METEOR· 2022-02-15
    22.95
    best: 38.53 (T5)
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • TranslationonMuLD (OpenSubtitles)
    Rouge-L· 2022-02-15
    22.17
    best: 35.35 (T5)
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • Natural Language UnderstandingonLexGLUE
    CaseHOLD· 2021-10-03
    72
    best: 75.6 (CaseLaw-BERT)
    LexGLUE: A Benchmark Dataset for Legal Language Understanding in EnglisharXiv:2110.00976
  • Cross-LingualonReddit Ideological and Extreme Bias Dataset
    weighted-F1 score
    76.47
    best: 79.1 (SVM)
  • Cross-Lingual Document ClassificationonReddit Ideological and Extreme Bias Dataset
    weighted-F1 score
    76.47
    best: 79.1 (SVM)

Methodology4 results

  • ClassificationonMuLD (Character Type)
    F1· 2022-02-15
    82.58
    SOTA
    MuLD: The Multitask Long Document BenchmarkarXiv:2202.07362
  • ClassificationonUK Key Stage Readability
    F1· 2024-11-26
    74
    best: 99.6 (ELECTRA + ANN)
    What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational LinguisticsarXiv:2411.17593
  • Data MiningonIMDb Movie Reviews
    Accuracy· 2023-08-07
    95
    best: 95.6 (ELECTRA)
    Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion MiningarXiv:2308.03235
  • Interpretable Machine LearningonIMDb Movie Reviews
    Accuracy· 2023-08-07
    95
    best: 95.6 (ELECTRA)
    Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion MiningarXiv:2308.03235

Medical2 results

  • Language ModellingonMultiNews test
    Perplexity· 2021-01-02
    2.34
    best: 1.76 (CD-LM)
    SOTA
    CDLM: Cross-Document Language ModelingarXiv:2101.00406
  • Language ModellingonMultiNews val
    Perplexity· 2021-01-02
    2.03
    best: 1.69 (CD-LM)
    SOTA
    CDLM: Cross-Document Language ModelingarXiv:2101.00406