Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang, Song-Chun Zhu

2021-05-10ACL 2021 5Semantic Parsing Question Answering Mathematical Reasoning Scene Parsing Mathematical Question Answering Visual Reasoning Arithmetic Reasoning Visual Question Answering (VQA)

Paper PDF Code(official)

Abstract

Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps.

Results

Task	Dataset	Metric	Value	Model
Question Answering	Geometry3K	Accuracy (%)	90.9	Human Expert
Question Answering	Geometry3K	Accuracy (%)	78.3	Inter-GPS (GT)
Question Answering	Geometry3K	Accuracy (%)	57.5	Inter-GPS
Question Answering	Geometry3K	Accuracy (%)	56.9	Human
Question Answering	Geometry3K	Accuracy (%)	25	Random
Question Answering	GeoS	Accuracy (%)	67	Inter-GPS
Scene Parsing	PGDP5K	Total Accuracy	27.3	Inter-GPS
Mathematical Question Answering	Geometry3K	Accuracy (%)	90.9	Human Expert
Mathematical Question Answering	Geometry3K	Accuracy (%)	78.3	Inter-GPS (GT)
Mathematical Question Answering	Geometry3K	Accuracy (%)	57.5	Inter-GPS
Mathematical Question Answering	Geometry3K	Accuracy (%)	56.9	Human
Mathematical Question Answering	Geometry3K	Accuracy (%)	25	Random
Mathematical Question Answering	GeoS	Accuracy (%)	67	Inter-GPS
2D Semantic Segmentation	PGDP5K	Total Accuracy	27.3	Inter-GPS
Mathematical Reasoning	PGPS9K	Completion accuracy	59.8	Inter-GPS

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Abstract

Results

Related Papers

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Abstract

Results

Related Papers