Pythia 12B (0-shot)
Reported on 6 benchmarks across 4 tasks · 1 paper
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Natural Language Processing5 results
- Accuracy· 2023-04-0376best: 90.1 (Unicorn 11B (fine-tuned))
- Accuracy· 2023-04-0363.9best: 96.1 (ST-MoE-32B 269B (fine-tuned))
- Accuracy· 2023-04-0331.8best: 96.4 (GPT-4 (few-shot, k=25))
- Accuracy· 2023-04-0370.2best: 95.2 (ST-MoE-32B 269B (fine-tuned))
- Accuracy· 2023-04-0354.8best: 100 (PaLM 540B (fine-tuned))
Medical1 result
- Accuracy· 2023-04-0370.46best: 89.7 (PaLM-540B (Few-Shot))