TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Reading Comprehension/RACE

Reading Comprehension on RACE

Metric: Accuracy (High) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy (High)▼Extra DataPaperDate↕Code
1ALBERTxxlarge+DUMA(ensemble)92.6NoDUMA: Reading Comprehension with Transposition T...2020-01-26Code
2Megatron-BERT (ensemble)90NoMegatron-LM: Training Multi-Billion Parameter La...2019-09-17Code
3Megatron-BERT88.6NoMegatron-LM: Training Multi-Billion Parameter La...2019-09-17Code
4B10-10-1084.4NoFunnel-Transformer: Filtering out Sequential Red...2020-06-05Code
5XLNet84NoXLNet: Generalized Autoregressive Pretraining fo...2019-06-19Code
6RoBERTa81.3NoRoBERTa: A Robustly Optimized BERT Pretraining A...2019-07-26Code
7LLaMA 65B (zero-shot)51.6NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
8PaLM 540B (zero-shot)49.1NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
9LLaMA 33B (zero-shot)48.3NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
10PaLM 62B (zero-shot)47.5NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
11LLaMA 13B (zero-shot)47.2NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
12LLaMA 7B (zero-shot)46.9NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
13GPT-3 175B (zero-shot)45.5NoLanguage Models are Few-Shot Learners2020-05-28Code
14PaLM 8B (zero-shot)42.3NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
15Bloomberg GPT (one-shot)41.74NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
16BLOOM 176B (one-shot)39.14NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
17OPT 66B (one-shot)37.02NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
18GPT-NeoX (one-shot)34.33NoBloombergGPT: A Large Language Model for Finance2023-03-30Code