TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Knowledge Base/Mathematical Reasoning

Mathematical Reasoning

29 benchmarks805 papers

Benchmarks

Mathematical Reasoning on MATH

AccuracyParameters (Billions)

Mathematical Reasoning on MAWPS

Accuracy (%)

Mathematical Reasoning on SVAMP

Execution AccuracyAccuracy

Mathematical Reasoning on Math23K

Accuracy (5-fold)Accuracy (training-test)weakly-supervised

Mathematical Reasoning on ALG514

Accuracy (%)

Mathematical Reasoning on AIME24

Acc

Mathematical Reasoning on ASDiv-A

Execution Accuracy

Mathematical Reasoning on FrontierMath

Accuracy

Mathematical Reasoning on Lila (IID)

Accuracy

Mathematical Reasoning on Lila (OOD)

Accuracy

Mathematical Reasoning on PGPS9K

Completion accuracy

Mathematical Reasoning on ParaMAWPS

Accuracy (%)

Mathematical Reasoning on DRAW-1K

Accuracy (%)

Mathematical Reasoning on MathQA

Answer Accuracy

Mathematical Reasoning on AMC23

Acc

Mathematical Reasoning on BIG-bench

AccuracyAccuracy

Mathematical Reasoning on GeoQA

Accuracy (%)

Mathematical Reasoning on SVAMP (1:N)

Execution Accuracy

Mathematical Reasoning on GSM-Plus

1:1 Accuracy

Mathematical Reasoning on MATH minival

Accuracy

Mathematical Reasoning on MATH500

Acc

Mathematical Reasoning on PEN

Accuracy (%)

Mathematical Reasoning on UniGeo

Accuracy (%)

Mathematical Reasoning on UniGeo (PRV)

Accuracy (%)