Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Question Answering
/
MultiRC
Question Answering on MultiRC
Metric: EM (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Export CSV
Sort:
EM (best first)
EM (worst first)
Date (newest first)
Date (oldest first)
Model name (A→Z)
#
Model
↕
EM
▼
Extra Data
Paper
Date
↕
Code
1
PaLM 540B (finetuned)
69.2
No
PaLM: Scaling Language Modeling with Pathways
2022-04-05
Code
2
DeBERTa-1.5B
63.7
No
DeBERTa: Decoding-enhanced BERT with Disentangle...
2020-06-05
Code
3
T5-11B
63.3
No
Exploring the Limits of Transfer Learning with a...
2019-10-23
Code
4
Turing NLR v5 XXL 5.4B (fine-tuned)
63
No
Toward Efficient Language Model Pretraining and ...
2022-12-04
-
5
Vega v2 6B (fine-tuned)
62.4
No
Toward Efficient Language Model Pretraining and ...
2022-12-04
-
6
Hybrid H3 355M (3-shot, logit scoring)
59.7
No
Hungry Hungry Hippos: Towards Language Modeling ...
2022-12-28
Code
7
Hybrid H3 355M (0-shot, logit scoring)
59.5
No
Hungry Hungry Hippos: Towards Language Modeling ...
2022-12-28
Code
8
Hybrid H3 125M (0-shot, logit scoring)
51.4
No
Hungry Hungry Hippos: Towards Language Modeling ...
2022-12-28
Code
9
Hybrid H3 125M (3-shot, logit scoring)
48.9
No
Hungry Hungry Hippos: Towards Language Modeling ...
2022-12-28
Code
10
KELM (finetuning BERT-large based single model)
27.2
No
KELM: Knowledge Enhanced Pre-Trained Language Re...
2021-09-09
Code
11
BERT-large(single model)
24.1
No
BERT: Pre-training of Deep Bidirectional Transfo...
2018-10-11
Code
12
N-Grammer 343M
11.3
No
N-Grammer: Augmenting Transformers with latent n...
2022-07-13
Code
#1
PaLM 540B (finetuned)
SOTA
69.2
EM
· 2022-04-05
PaLM: Scaling Language Modeling with Pathways
Code
#2
DeBERTa-1.5B
SOTA
63.7
EM
· 2020-06-05
DeBERTa: Decoding-enhanced BERT with Disentangled Attention
Code
#3
T5-11B
SOTA
63.3
EM
· 2019-10-23
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer
Code
#4
Turing NLR v5 XXL 5.4B (fine-tuned)
63
EM
· 2022-12-04
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
#5
Vega v2 6B (fine-tuned)
62.4
EM
· 2022-12-04
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
#6
Hybrid H3 355M (3-shot, logit scoring)
59.7
EM
· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Code
#7
Hybrid H3 355M (0-shot, logit scoring)
59.5
EM
· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Code
#8
Hybrid H3 125M (0-shot, logit scoring)
51.4
EM
· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Code
#9
Hybrid H3 125M (3-shot, logit scoring)
48.9
EM
· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models
Code
#10
KELM (finetuning BERT-large based single model)
27.2
EM
· 2021-09-09
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs
Code
#11
BERT-large(single model)
SOTA
24.1
EM
· 2018-10-11
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding
Code
#12
N-Grammer 343M
11.3
EM
· 2022-07-13
N-Grammer: Augmenting Transformers with latent n-grams
Code