Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Models/T5-XXL 11B (fine-tuned)

T5-XXL 11B (fine-tuned)

Reported on 9 benchmarks across 4 tasks · 2 papers · 8 SOTA

Note: results are matched by exact model name. Different papers may use the same name for different model variants.

Natural Language Processing9 results

Question AnsweringonCOPA
Accuracy· 2019-10-23
94.8
best: 100 (PaLM 540B (finetuned) )
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Question AnsweringonMultiRC
F1· 2019-10-23
88.1
best: 90.1 (PaLM 540B (finetuned) )
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Question AnsweringonBoolQ
Accuracy· 2019-10-23
91.2
best: 99.87 (Mistral-Nemo 12B (HPT))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Common Sense ReasoningonReCoRD
EM· 2019-10-23
93.4
best: 95.9 (Turing NLR v5 XXL 5.4B (fine-tuned))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Natural Language InferenceonCommitmentBank
Accuracy· 2019-10-23
96.8
best: 100 (PaLM 540B (finetuned))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Natural Language InferenceonCommitmentBank
F1· 2019-10-23
93.9
best: 100 (PaLM 540B (finetuned))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Natural Language InferenceonMultiNLI
Matched· 2019-10-23
92
best: 92.6 (Turing NLR v5 XXL 5.4B (fine-tuned))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Coreference ResolutiononWinograd Schema Challenge
Accuracy· 2019-10-23
93.8
best: 100 (PaLM 540B (fine-tuned))
SOTA
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer arXiv:1910.10683
Common Sense ReasoningonCommonsenseQA
Accuracy· 2020-05-02
78.1
best: 92.54 (GPT-4o (HPT))
UnifiedQA: Crossing Format Boundaries With a Single QA System arXiv:2005.00700