Premise Selection for Theorem Proving by Deep Graph Embedding

Mingzhe Wang, Yihe Tang, Jian Wang, Jia Deng

2017-09-28NeurIPS 2017 12Automated Theorem Proving General Classification Graph Embedding

Abstract

We propose a deep learning-based approach to the problem of premise selection: selecting mathematical statements relevant for proving a given conjecture. We represent a higher-order logic formula as a graph that is invariant to variable renaming but still fully preserves syntactic and semantic information. We then embed the graph into a vector via a novel embedding method that preserves the information of edge ordering. Our approach achieves state-of-the-art results on the HolStep dataset, improving the classification accuracy from 83% to 90.3%.

Results

Task	Dataset	Metric	Value	Model
Automated Theorem Proving	HolStep (Unconditional)	Classification Accuracy	0.9	FormulaNet
Automated Theorem Proving	HolStep (Unconditional)	Classification Accuracy	0.89	FormulaNet-basic
Automated Theorem Proving	HolStep (Conditional)	Classification Accuracy	0.903	FormulaNet
Automated Theorem Proving	HolStep (Conditional)	Classification Accuracy	0.891	FormulaNet-basic
Mathematical Proofs	HolStep (Unconditional)	Classification Accuracy	0.9	FormulaNet
Mathematical Proofs	HolStep (Unconditional)	Classification Accuracy	0.89	FormulaNet-basic
Mathematical Proofs	HolStep (Conditional)	Classification Accuracy	0.903	FormulaNet
Mathematical Proofs	HolStep (Conditional)	Classification Accuracy	0.891	FormulaNet-basic

Related Papers

SMART: Relation-Aware Learning of Geometric Representations for Knowledge Graphs2025-07-17 CriticLean: Critic-Guided Reinforcement Learning for Mathematical Formalization2025-07-08 Prover Agent: An Agent-based Framework for Formal Mathematical Proofs2025-06-24 Towards Advanced Mathematical Reasoning for LLMs via First-Order Logic Theorem Proving2025-06-20 Metapath-based Hyperbolic Contrastive Learning for Heterogeneous Graph Embedding2025-06-20 ETT-CKGE: Efficient Task-driven Tokens for Continual Knowledge Graph Embedding2025-06-09 MATP-BENCH: Can MLLM Be a Good Automated Theorem Prover for Multimodal Problems?2025-06-06 Safe: Enhancing Mathematical Reasoning in Large Language Models via Retrospective Step-aware Formal Verification2025-06-05