TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Natural Language Processing/Question Answering/StoryCloze

Question Answering on StoryCloze

Metric: Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Accuracy▼Extra DataPaperDate↕Code
1BLOOMZ96.3NoCrosslingual Generalization through Multitask Fi...2022-11-03Code
2Flipped-3B95.88NoGuess the Instruction! Flipped Learning Makes La...2022-10-06Code
3FLAN 137B (few-shot, k=10)94.7NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
4T0-3B (CoT fine-tuned)94.5NoThe CoT Collection: Improving Zero-shot and Few-...2023-05-23Code
5KiC-770M94.4NoKnowledge-in-Context: Towards Knowledgeable Semi...2022-10-28-
6FLAN 137B (zero-shot)93.4NoFinetuned Language Models Are Zero-Shot Learners2021-09-03Code
7Reading Strategies Model88.3NoImproving Machine Reading Comprehension with Gen...2018-10-31Code
8Finetuned Transformer LM86.5No--Code
9RoE-3B86.33NoExploring the Benefits of Training Expert Langua...2023-02-07Code
10OPT-175B79.82NoSparseGPT: Massive Language Models Can Be Accura...2023-01-02Code
11SparseGPT (175B, 50% Sparsity)78.87NoSparseGPT: Massive Language Models Can Be Accura...2023-01-02Code
12Memory chains and semantic supervision78.7No--Code
13Hidden Coherence Model77.6No---
14SparseGPT (175B, 4:8 Sparsity)77.02NoSparseGPT: Massive Language Models Can Be Accura...2023-01-02Code
15val-LS-skip76.5NoA Simple and Effective Approach to the Story Clo...2018-03-15-
16SparseGPT (175B, 2:4 Sparsity)76.19NoSparseGPT: Massive Language Models Can Be Accura...2023-01-02Code
17sMLP – deterministic 9.4B (0-shot)74.7NoEfficient Language Modeling with Sparse all-MLP2022-03-14-
18Switch Transformer 9B73.3NoEfficient Language Modeling with Sparse all-MLP2022-03-14-
19GPT-3 Large 760M (zero-shot)72.4NoLanguage Models are Few-Shot Learners2020-05-28Code
20Gshard 9B67.9NoEfficient Language Modeling with Sparse all-MLP2022-03-14-
21HASH Layers 10B (0-shot)64.7NoEfficient Language Modeling with Sparse all-MLP2022-03-14-
22Base Layers 10B (0-shot)61.4NoEfficient Language Modeling with Sparse all-MLP2022-03-14-
23OPT-175B (50% Sparsity)47.1NoSparseGPT: Massive Language Models Can Be Accura...2023-01-02Code