Prover Agent: An Agent-based Framework for Formal Mathematical Proofs
Kaito Baba, Chaoran Liu, Shuhei Kurita, Akiyoshi Sannai
Abstract
We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and feedback from Lean while also generating auxiliary lemmas to assist in discovering the overall proof strategy. It achieves an 86.1% success rate on the MiniF2F benchmark, establishing a new state-of-the-art among methods using small language models (SLMs) with a much lower sample budget than previous approaches. We also present case studies illustrating how these generated lemmas contribute to solving challenging problems.
Related Papers
Token Compression Meets Compact Vision Transformers: A Survey and Comparative Evaluation for Edge AI2025-07-13OpenAgentSafety: A Comprehensive Framework for Evaluating Real-World AI Agent Safety2025-07-08CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization2025-07-08STELLA: Self-Evolving LLM Agent for Biomedical Research2025-07-01AI Agents-as-Judge: Automated Assessment of Accuracy, Consistency, Completeness and Clarity for Enterprise Documents2025-06-23Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving2025-06-20Exploring Big Five Personality and AI Capability Effects in LLM-Simulated Negotiation Dialogues2025-06-19xbench: Tracking Agents Productivity Scaling with Profession-Aligned Real-World Evaluations2025-06-16