Mathematical Reasoning on GSM-Plus
Metric: 1:1 Accuracy (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | 1:1 Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | GPT-4 | 85.6 | No | GSM-Plus: A Comprehensive Benchmark for Evaluati... | 2024-02-29 | Code |