Auto Debugging on Big-bench Lite

Metric: Exact string match (higher is better)

LeaderboardDataset
Loading chart...
#ModelExact string matchExtra DataPaperDateCode
1PaLM 62B (few-shot, k=5)38.2NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
2PaLM 540B (few-shot, k=5)38.2NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code
3PaLM 8B (few-shot, k=5)14.7NoPaLM: Scaling Language Modeling with Pathways2022-04-05Code