ReProver
Reported on 4 benchmarks across 2 tasks
Note: results are matched by exact model name. Different papers may use the same name for different model variants.
Miscellaneous4 results
- 26.5best: 68.85 (Kimina-Prover-Preview)
- cumulative26.5best: 80.74 (Kimina-Prover-Preview)
- Pass@3226.5best: 68.85 (Kimina-Prover-Preview)
- cumulative26.5best: 80.74 (Kimina-Prover-Preview)