TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Reading Comprehension/RACE

Reading Comprehension on RACE

Metric: Accuracy (Middle) (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy (Middle)▼Extra DataPaperDate↕Code
1Megatron-BERT (ensemble)93.1NoMegatron-LM: Training Multi-Billion Parameter La...2019-09-17Code
2Megatron-BERT91.8NoMegatron-LM: Training Multi-Billion Parameter La...2019-09-17Code
3B10-10-1088.8NoFunnel-Transformer: Filtering out Sequential Red...2020-06-05Code
4ALBERTxxlarge+DUMA(ensemble)88.7NoDUMA: Reading Comprehension with Transposition T...2020-01-26Code
5XLNet88.6NoXLNet: Generalized Autoregressive Pretraining fo...2019-06-19Code
6RoBERTa86.5NoRoBERTa: A Robustly Optimized BERT Pretraining A...2019-07-26Code
7PaLM 540B (zero-shot)68.1NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
8LLaMA 65B (zero-shot)67.9NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
9PaLM 62B (zero-shot)64.3NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
10LLaMA 33B (zero-shot)64.1NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
11LLaMA 13B (zero-shot)61.6NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
12LLaMA 7B (zero-shot)61.1NoLLaMA: Open and Efficient Foundation Language Mo...2023-02-27Code
13GPT-3 175B (0-shot)58.4NoLanguage Models are Few-Shot Learners2020-05-28Code
14PaLM 8B (zero-shot)57.9NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
15Bloomberg GPT (one-shot)54.32NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
16BLOOM 176B (one-shot)52.3NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
17OPT 66B (one-shot)47.42NoBloombergGPT: A Large Language Model for Finance2023-03-30Code
18GPT-NeoX (one-shot)41.23NoBloombergGPT: A Large Language Model for Finance2023-03-30Code