TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Natural Language Inference/ANLI test

Natural Language Inference on ANLI test

Metric: A1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕A1▼Extra DataPaperDate↕Code
1T5-3B (explanation prompting)81.8No---
2T0-11B (explanation prompting)75.6No---
3InfoBERT (RoBERTa)75YesInfoBERT: Improving Robustness of Language Model...2020-10-05Code
4PaLM 2-L (one-shot)73.1NoPaLM 2 Technical Report2023-05-17Code
5RoBERTa (Large)72.4YesRoBERTa: A Robustly Optimized BERT Pretraining A...2019-07-26Code
6ALUM (RoBERTa-LARGE)72.3YesAdversarial Training for Large Neural Language M...2020-04-20Code
7XLNet (Large)70.3YesXLNet: Generalized Autoregressive Pretraining fo...2019-06-19Code
8ChatGPT62.3NoA Systematic Study and Comprehensive Evaluation ...2023-05-29Code
9PaLM 2-M (one-shot)58.1NoPaLM 2 Technical Report2023-05-17Code
10PaLM 2-S (one-shot)53.1NoPaLM 2 Technical Report2023-05-17Code
11T0-3B (CoT fine-tuned)41.7NoThe CoT Collection: Improving Zero-shot and Few-...2023-05-23Code
12Flipped-3B39.99NoGuess the Instruction! Flipped Learning Makes La...2022-10-06Code
13GPT-336.8YesLanguage Models are Few-Shot Learners2020-05-28Code
14KiC-770M36.3NoKnowledge-in-Context: Towards Knowledgeable Semi...2022-10-28-
15RoE-3B35.49NoExploring the Benefits of Training Expert Langua...2023-02-07Code
16BLOOM 176B (one-shot)33.6NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
17OPT 66B (one-shot)33.1NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
18Bloomberg GPT (one-shot)32.9NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
19GPT-NeoX (one-shot)32.6NoBloombergGPT: A Large Language Model for Finance2023-03-30Code