Mathematical Reasoning on SVAMP (1:N)
Metric: Execution Accuracy (higher is better)
LeaderboardDataset
Loading chart...
Results
Submit a result| # | Model↕ | Execution Accuracy▼ | Extra Data | Paper | Date↕ | Code |
|---|---|---|---|---|---|---|
| 1 | ATHENA (roberta-large) | 67.8 | No | ATHENA: Mathematical Reasoning with Thought Expa... | 2023-11-02 | Code |
| 2 | ATHENA (roberta-base) | 52.5 | No | ATHENA: Mathematical Reasoning with Thought Expa... | 2023-11-02 | Code |