Tasks
SotA
Datasets
Papers
Methods
Submit
About
SotA
/
Natural Language Processing
/
Natural Language Inference
/
ANLI test
Natural Language Inference on ANLI test
Metric: A1 (higher is better)
Leaderboard
Dataset
Loading chart...
Results
Submit a result
Hide extra data
Export CSV
#
Model
↕
A1
▼
Extra Data
Paper
Date
↕
Code
1
T5-3B (explanation prompting)
81.8
No
-
-
-
2
T0-11B (explanation prompting)
75.6
No
-
-
-
3
InfoBERT (RoBERTa)
75
Yes
InfoBERT: Improving Robustness of Language Model...
2020-10-05
Code
4
PaLM 2-L (one-shot)
73.1
No
PaLM 2 Technical Report
2023-05-17
Code
5
RoBERTa (Large)
72.4
Yes
RoBERTa: A Robustly Optimized BERT Pretraining A...
2019-07-26
Code
6
ALUM (RoBERTa-LARGE)
72.3
Yes
Adversarial Training for Large Neural Language M...
2020-04-20
Code
7
XLNet (Large)
70.3
Yes
XLNet: Generalized Autoregressive Pretraining fo...
2019-06-19
Code
8
ChatGPT
62.3
No
A Systematic Study and Comprehensive Evaluation ...
2023-05-29
Code
9
PaLM 2-M (one-shot)
58.1
No
PaLM 2 Technical Report
2023-05-17
Code
10
PaLM 2-S (one-shot)
53.1
No
PaLM 2 Technical Report
2023-05-17
Code
11
T0-3B (CoT fine-tuned)
41.7
No
The CoT Collection: Improving Zero-shot and Few-...
2023-05-23
Code
12
Flipped-3B
39.99
No
Guess the Instruction! Flipped Learning Makes La...
2022-10-06
Code
13
GPT-3
36.8
Yes
Language Models are Few-Shot Learners
2020-05-28
Code
14
KiC-770M
36.3
No
Knowledge-in-Context: Towards Knowledgeable Semi...
2022-10-28
-
15
RoE-3B
35.49
No
Exploring the Benefits of Training Expert Langua...
2023-02-07
Code
16
BLOOM 176B (one-shot)
33.6
No
BloombergGPT: A Large Language Model for Finance
2023-03-30
Code
17
OPT 66B (one-shot)
33.1
No
BloombergGPT: A Large Language Model for Finance
2023-03-30
Code
18
Bloomberg GPT (one-shot)
32.9
No
BloombergGPT: A Large Language Model for Finance
2023-03-30
Code
19
GPT-NeoX (one-shot)
32.6
No
BloombergGPT: A Large Language Model for Finance
2023-03-30
Code