TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Knowledge Base/Mathematical Reasoning/ASDiv-A

Mathematical Reasoning on ASDiv-A

Metric: Execution Accuracy (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Execution Accuracy▼Extra DataPaperDate↕Code
1ATHENA (roberta-large)91NoATHENA: Mathematical Reasoning with Thought Expa...2023-11-02Code
2MMOS-DeepSeekMath-7B(0-shot)87.6YesAn Empirical Study of Data Ability Boundary in L...2024-02-23Code
3ATHENA (roberta-base)86.4NoATHENA: Mathematical Reasoning with Thought Expa...2023-11-02Code
4MMOS-CODE-34B(0-shot)85.1YesAn Empirical Study of Data Ability Boundary in L...2024-02-23Code
5OpenMath-CodeLlama-70B (w/ code)84.7YesOpenMathInstruct-1: A 1.8 Million Math Instructi...2024-02-15Code
6Graph2Tree with RoBERTa82.2NoAre NLP Models really able to Solve Simple Math ...2021-03-12Code
7GTS with RoBERTa81.2NoAre NLP Models really able to Solve Simple Math ...2021-03-12Code
8MMOS-CODE-7B(0-shot)78.6YesAn Empirical Study of Data Ability Boundary in L...2024-02-23Code
9LSTM Seq2Seq with RoBERTa76.9NoAre NLP Models really able to Solve Simple Math ...2021-03-12Code