Prover Agent: An Agent-based Framework for Formal Mathematical Proofs

Kaito Baba, Chaoran Liu, Shuhei Kurita, Akiyoshi Sannai

2025-06-24AI Agent Automated Theorem Proving Mathematical Proofs

Abstract

We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and feedback from Lean while also generating auxiliary lemmas to assist in discovering the overall proof strategy. It achieves an 86.1% success rate on the MiniF2F benchmark, establishing a new state-of-the-art among methods using small language models (SLMs) with a much lower sample budget than previous approaches. We also present case studies illustrating how these generated lemmas contribute to solving challenging problems.

Related Papers

Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI2025-07-13 OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety2025-07-08 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization2025-07-08 STELLA: Self-Evolving LLM Agent for Biomedical Research2025-07-01 AI Agents-as-Judge: Automated Assessment of Accuracy, Consistency, Completeness and Clarity for Enterprise Documents2025-06-23 Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving2025-06-20 Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues2025-06-19 xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations2025-06-16