Question Answering on MultiRC

Metric: EM (higher is better)

LeaderboardDataset

Loading chart...

Results

Sort:

#	Model↕	EM▼	Extra Data	Paper	Date↕	Code
1	PaLM 540B (finetuned)	69.2	No	PaLM: Scaling Language Modeling with Pathways	2022-04-05	Code
2	DeBERTa-1.5B	63.7	No	DeBERTa: Decoding-enhanced BERT with Disentangle...	2020-06-05	Code
3	T5-11B	63.3	No	Exploring the Limits of Transfer Learning with a...	2019-10-23	Code
4	Turing NLR v5 XXL 5.4B (fine-tuned)	63	No	Toward Efficient Language Model Pretraining and ...	2022-12-04	-
5	Vega v2 6B (fine-tuned)	62.4	No	Toward Efficient Language Model Pretraining and ...	2022-12-04	-
6	Hybrid H3 355M (3-shot, logit scoring)	59.7	No	Hungry Hungry Hippos: Towards Language Modeling ...	2022-12-28	Code
7	Hybrid H3 355M (0-shot, logit scoring)	59.5	No	Hungry Hungry Hippos: Towards Language Modeling ...	2022-12-28	Code
8	Hybrid H3 125M (0-shot, logit scoring)	51.4	No	Hungry Hungry Hippos: Towards Language Modeling ...	2022-12-28	Code
9	Hybrid H3 125M (3-shot, logit scoring)	48.9	No	Hungry Hungry Hippos: Towards Language Modeling ...	2022-12-28	Code
10	KELM (finetuning BERT-large based single model)	27.2	No	KELM: Knowledge Enhanced Pre-Trained Language Re...	2021-09-09	Code
11	BERT-large(single model)	24.1	No	BERT: Pre-training of Deep Bidirectional Transfo...	2018-10-11	Code
12	N-Grammer 343M	11.3	No	N-Grammer: Augmenting Transformers with latent n...	2022-07-13	Code

#1PaLM 540B (finetuned) SOTA
69.2
EM· 2022-04-05
PaLM: Scaling Language Modeling with Pathways Code
#2DeBERTa-1.5BSOTA
63.7
EM· 2020-06-05
DeBERTa: Decoding-enhanced BERT with Disentangled Attention Code
#3T5-11BSOTA
63.3
EM· 2019-10-23
Exploring the Limits of Transfer Learning with a Unified Text-to-Text Transformer Code
#4Turing NLR v5 XXL 5.4B (fine-tuned)
63
EM· 2022-12-04
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
#5Vega v2 6B (fine-tuned)
62.4
EM· 2022-12-04
Toward Efficient Language Model Pretraining and Downstream Adaptation via Self-Evolution: A Case Study on SuperGLUE
#6Hybrid H3 355M (3-shot, logit scoring)
59.7
EM· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models Code
#7Hybrid H3 355M (0-shot, logit scoring)
59.5
EM· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models Code
#8Hybrid H3 125M (0-shot, logit scoring)
51.4
EM· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models Code
#9Hybrid H3 125M (3-shot, logit scoring)
48.9
EM· 2022-12-28
Hungry Hungry Hippos: Towards Language Modeling with State Space Models Code
#10KELM (finetuning BERT-large based single model)
27.2
EM· 2021-09-09
KELM: Knowledge Enhanced Pre-Trained Language Representations with Message Passing on Hierarchical Relational Graphs Code
#11BERT-large(single model)SOTA
24.1
EM· 2018-10-11
BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding Code
#12N-Grammer 343M
11.3
EM· 2022-07-13
N-Grammer: Augmenting Transformers with latent n-grams Code