TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

SotA/Miscellaneous/Automated Theorem Proving/miniF2F-test

Automated Theorem Proving on miniF2F-test

Metric: Pass@1 (higher is better)

LeaderboardDataset
Loading chart...

Results

Submit a result
#Model↕Pass@1▼Extra DataPaperDate↕Code
1Kimina-Prover-Preview52.94YesKimina-Prover Preview: Towards Large Formal Reas...2025-04-15Code
2ProofAug36.5NoEfficient Neural Theorem Proving via Fine-graine...2025-01-30Code
3Thor + expert iteration on autoformalised theorems35.2Yes---
4COPRA + GPT-4-turbo30.7NoAn In-Context Learning Agent for Formal Theorem-...2023-10-06Code
5DeepSeek-Prover30YesDeepSeek-Prover: Advancing Theorem Proving in LL...2024-05-23-
6Thor29.9NoThor: Wielding Hammers to Integrate Language Mod...2022-05-22-
7Lean Expert Iteration29.6YesFormal Mathematics Statement Curriculum Learning2022-02-03Code
8MMOS-DeepSeekMath-7B28.3NoAn Empirical Study of Data Ability Boundary in L...2024-02-23Code
9Lean GPT-f24.6NoMiniF2F: a cross-system benchmark for formal Oly...2021-08-31Code
10PACT (reproduced by Thor)24.6NoProof Artifact Co-training for Theorem Proving w...2021-02-11Code
11COPRA + GPT-423.3NoAn In-Context Learning Agent for Formal Theorem-...2023-10-06Code
12Sledgehammer + heuristics20.9NoDraft, Sketch, and Prove: Guiding Formal Theorem...2022-10-21Code
13Lean tidy18NoMiniF2F: a cross-system benchmark for formal Oly...2021-08-31Code
14COPRA + GPT-3.511.9NoAn In-Context Learning Agent for Formal Theorem-...2023-10-06Code
15Sledgehammer10.4NoThor: Wielding Hammers to Integrate Language Mod...2022-05-22-
16Metamath GPT-f1.3NoMiniF2F: a cross-system benchmark for formal Oly...2021-08-31Code