Common Sense Reasoning on BIG-bench (Known Unknowns)
Metric: Accuracy (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | PaLM-540B (few-shot, k=5) | 73.9 | No | PaLM: Scaling Language Modeling with Pathways | 2022-04-05 | Code |
| 2 | Chinchilla-70B (few-shot, k=5) | 65.2 | No | Training Compute-Optimal Large Language Models | 2022-03-29 | Code |
| 3 | Gopher-280B (few-shot, k=5) | 63.6 | No | Scaling Language Models: Methods, Analysis & Ins... | 2021-12-08 | Code |