Pythia 12B (5-shot)
Reported on 5 benchmarks across 3 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- Accuracy· 2023-04-0376.7best: 90.1 (Unicorn 11B (fine-tuned))
- Accuracy· 2023-04-0366.6best: 96.1 (ST-MoE-32B 269B (fine-tuned))
- Accuracy· 2023-04-0336.8best: 96.4 (GPT-4 (few-shot, k=25))
- Accuracy· 2023-04-0371.5best: 95.2 (ST-MoE-32B 269B (fine-tuned))
- Accuracy· 2023-04-0336.5best: 100 (PaLM 540B (fine-tuned))