Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/Longformer

Longformer

Reported on 28 benchmarks across 12 tasks · 6 papers · 17 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing22 results

Binary text classificationonMAGE (Arbitrary-domains & Arbitrary-models)
Average Recall· 2023-05-22
0.9053
best: 0.9611 (GigaCheck (Mistral-7B))
SOTA
MAGE: Machine-generated Text Detection in the Wild arXiv:2305.13242
Question AnsweringonMuLD (NarrativeQA)
BLEU-1· 2022-02-15
19.84
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (NarrativeQA)
BLEU-4· 2022-02-15
62
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (NarrativeQA)
METEOR· 2022-02-15
4.52
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (NarrativeQA)
Rouge-L· 2022-02-15
22.09
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (HotpotQA)
BLEU-1· 2022-02-15
30.38
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (HotpotQA)
BLEU-4· 2022-02-15
16.76
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (HotpotQA)
METEOR· 2022-02-15
4.98
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Question AnsweringonMuLD (HotpotQA)
Rouge-L· 2022-02-15
30.49
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
SummarizationonMuLD (VLSP)
BLEU-1· 2022-02-15
46.74
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
SummarizationonMuLD (VLSP)
METEOR· 2022-02-15
9.58
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
SummarizationonMuLD (VLSP)
Rouge-L· 2022-02-15
19.52
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Text ClassificationonMuLD (Character Type)
F1· 2022-02-15
82.58
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
TranslationonMuLD (OpenSubtitles)
BLEU-4· 2022-02-15
20
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Text ClassificationonUK Key Stage Readability
F1· 2024-11-26
74
best: 99.6 (ELECTRA + ANN)
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics arXiv:2411.17593
SummarizationonMuLD (VLSP)
BLEU-4· 2022-02-15
3.05
best: 84 (T5)
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
TranslationonMuLD (OpenSubtitles)
BLEU-1· 2022-02-15
22.74
best: 34.07 (T5)
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
TranslationonMuLD (OpenSubtitles)
METEOR· 2022-02-15
22.95
best: 38.53 (T5)
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
TranslationonMuLD (OpenSubtitles)
Rouge-L· 2022-02-15
22.17
best: 35.35 (T5)
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
Natural Language UnderstandingonLexGLUE
CaseHOLD· 2021-10-03
72
best: 75.6 (CaseLaw-BERT)
LexGLUE: A Benchmark Dataset for Legal Language Understanding in English arXiv:2110.00976
Cross-LingualonReddit Ideological and Extreme Bias Dataset
weighted-F1 score
76.47
best: 79.1 (SVM)
Cross-Lingual Document ClassificationonReddit Ideological and Extreme Bias Dataset
weighted-F1 score
76.47
best: 79.1 (SVM)

Methodology4 results

ClassificationonMuLD (Character Type)
F1· 2022-02-15
82.58
SOTA
MuLD: The Multitask Long Document Benchmark arXiv:2202.07362
ClassificationonUK Key Stage Readability
F1· 2024-11-26
74
best: 99.6 (ELECTRA + ANN)
What Differentiates Educational Literature? A Multimodal Fusion Approach of Transformers and Computational Linguistics arXiv:2411.17593
Data MiningonIMDb Movie Reviews
Accuracy· 2023-08-07
95
best: 95.6 (ELECTRA)
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining arXiv:2308.03235
Interpretable Machine LearningonIMDb Movie Reviews
Accuracy· 2023-08-07
95
best: 95.6 (ELECTRA)
Analysis of the Evolution of Advanced Transformer-Based Language Models: Experiments on Opinion Mining arXiv:2308.03235

Medical2 results

Language ModellingonMultiNews test
Perplexity· 2021-01-02
2.34
best: 1.76 (CD-LM)
SOTA
CDLM: Cross-Document Language Modeling arXiv:2101.00406
Language ModellingonMultiNews val
Perplexity· 2021-01-02
2.03
best: 1.69 (CD-LM)
SOTA
CDLM: Cross-Document Language Modeling arXiv:2101.00406