TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/MultiRC

Question Answering on MultiRC

Metric: F1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕F1▼Extra DataPaperDate↕Code
1PaLM 540B (finetuned) 90.1NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
2ST-MoE-32B 269B (fine-tuned)89.6NoST-MoE: Designing Stable and Transferable Sparse...2022-02-17Code
3Turing NLR v5 XXL 5.4B (fine-tuned)88.4NoToward Efficient Language Model Pretraining and ...2022-12-04-
4DeBERTa-1.5B88.2NoDeBERTa: Decoding-enhanced BERT with Disentangle...2020-06-05Code
5Vega v2 6B (fine-tuned)88.2NoToward Efficient Language Model Pretraining and ...2022-12-04-
6PaLM 2-L (one-shot)88.2NoPaLM 2 Technical Report2023-05-17Code
7T5-XXL 11B (fine-tuned)88.1NoExploring the Limits of Transfer Learning with a...2019-10-23Code
8ST-MoE-L 4.1B (fine-tuned)86NoST-MoE: Designing Stable and Transferable Sparse...2022-02-17Code
9PaLM 2-M (one-shot)84.1NoPaLM 2 Technical Report2023-05-17Code
10PaLM 2-S (one-shot)84NoPaLM 2 Technical Report2023-05-17Code
11FLAN 137B (prompt-tuned)83.4NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
12FLAN 137B (zero-shot)77.5NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
13GPT-3 175B (Few-Shot)75.4NoLanguage Models are Few-Shot Learners2020-05-28Code
14FLAN 137B (1-shot)72.1NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
15KELM (finetuning BERT-large based single model)70.8NoKELM: Knowledge Enhanced Pre-Trained Language Re...2021-09-09Code
16BERT-large(single model)70NoBERT: Pre-training of Deep Bidirectional Transfo...2018-10-11Code
17Neo-6B (QA + WS)63.8NoAsk Me Anything: A simple strategy for prompting...2022-10-05Code
18Bloomberg GPT 50B (1-shot)62.3NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
19N-Grammer 343M62NoN-Grammer: Augmenting Transformers with latent n...2022-07-13Code
20Neo-6B (few-shot)60.8NoAsk Me Anything: A simple strategy for prompting...2022-10-05Code
21AlexaTM 20B59.6NoAlexaTM 20B: Few-Shot Learning Using a Large-Sca...2022-08-02Code
22Neo-6B (QA)58.8NoAsk Me Anything: A simple strategy for prompting...2022-10-05Code
23BLOOM 176B (1-shot)26.7NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
24GPT-NeoX 20B (1-shot)22.9NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
25OPT 66B (1-shot)18.8NoBloombergGPT: A Large Language Model for Finance2023-03-30Code