TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Inter-GPS: Interpretable Geometry Problem Solving with For...

Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning

Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang, Song-Chun Zhu

2021-05-10ACL 2021 5Semantic ParsingQuestion AnsweringMathematical ReasoningScene ParsingMathematical Question AnsweringVisual ReasoningArithmetic ReasoningVisual Question Answering (VQA)
PaperPDFCode(official)

Abstract

Geometry problem solving has attracted much attention in the NLP community recently. The task is challenging as it requires abstract problem understanding and symbolic reasoning with axiomatic knowledge. However, current datasets are either small in scale or not publicly available. Thus, we construct a new large-scale benchmark, Geometry3K, consisting of 3,002 geometry problems with dense annotation in formal language. We further propose a novel geometry solving approach with formal language and symbolic reasoning, called Interpretable Geometry Problem Solver (Inter-GPS). Inter-GPS first parses the problem text and diagram into formal language automatically via rule-based text parsing and neural object detecting, respectively. Unlike implicit learning in existing methods, Inter-GPS incorporates theorem knowledge as conditional rules and performs symbolic reasoning step by step. Also, a theorem predictor is designed to infer the theorem application sequence fed to the symbolic solver for the more efficient and reasonable searching path. Extensive experiments on the Geometry3K and GEOS datasets demonstrate that Inter-GPS achieves significant improvements over existing methods. The project with code and data is available at https://lupantech.github.io/inter-gps.

Results

TaskDatasetMetricValueModel
Question AnsweringGeometry3KAccuracy (%)90.9Human Expert
Question AnsweringGeometry3KAccuracy (%)78.3Inter-GPS (GT)
Question AnsweringGeometry3KAccuracy (%)57.5Inter-GPS
Question AnsweringGeometry3KAccuracy (%)56.9Human
Question AnsweringGeometry3KAccuracy (%)25Random
Question AnsweringGeoSAccuracy (%)67Inter-GPS
Scene ParsingPGDP5KTotal Accuracy27.3Inter-GPS
Mathematical Question AnsweringGeometry3KAccuracy (%)90.9Human Expert
Mathematical Question AnsweringGeometry3KAccuracy (%)78.3Inter-GPS (GT)
Mathematical Question AnsweringGeometry3KAccuracy (%)57.5Inter-GPS
Mathematical Question AnsweringGeometry3KAccuracy (%)56.9Human
Mathematical Question AnsweringGeometry3KAccuracy (%)25Random
Mathematical Question AnsweringGeoSAccuracy (%)67Inter-GPS
2D Semantic SegmentationPGDP5KTotal Accuracy27.3Inter-GPS
Mathematical ReasoningPGPS9KCompletion accuracy59.8Inter-GPS

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16