TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/OBQA

Question Answering on OBQA

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1FLAN 137B (zero-shot)78.4NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
2FLAN 137B (few-shot, k=16)78.2NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
3LLaMA 65B (zero-shot)60.2NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
4LLaMA 33B (zero-shot)58.6NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
5GPT-3 175B (zero-shot)57.6NoLanguage Models are Few-Shot Learners2020-05-28Code
6LLaMA 7B (zero-shot)57.2NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
7LLaMA 13B (zero-shot)56.4NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
8PaLM 540B (zero-shot)53.4NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
9PaLM 62B (zero-shot)50.4NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code