Metric: Accuracy (higher is better)
| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | Chinchilla-70B (few-shot, k=5) | 94.3 | No | Training Compute-Optimal Large Language Models | 2022-03-29 | Code |
| 2 | Gopher-280B (few-shot, k=5) | 93.9 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |
| 3 | Chinchilla-70B (few-shot, k=5) | 87 | No | Training Compute-Optimal Large Language Models | 2022-03-29 | Code |
| 4 | Gopher-280B (few-shot, k=5) | 81.8 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |
| 5 | Gopher-280B (few-shot, k=5) | 75.7 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |
| 6 | Gopher-280B (few-shot, k=64) | 57.1 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |
| 7 | Gopher-280B (few-shot, k=5) | 38 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |
| 8 | Gopher-280B (few-shot, k=64) | 28.2 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |