Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Wenting Tan, Dongxiao Chen, Jieting Xue, ZiHao Wang, Taijie Chen

2024-10-10Mathematical Reasoning Math Math Word Problem Solving Arithmetic Reasoning

Abstract

Large Language Models (LLMs) exhibit impressive performance across various domains but still struggle with arithmetic reasoning tasks. Recent work shows the effectiveness of prompt design methods in enhancing reasoning capabilities. However, these approaches overlook crucial requirements for prior knowledge of specific concepts, theorems, and tricks to tackle most arithmetic reasoning problems successfully. To address this issue, we propose a novel and effective Teaching-Inspired Integrated Framework, which emulates the instructional process of a teacher guiding students. This method equips LLMs with essential concepts, relevant theorems, and similar problems with analogous solution approaches, facilitating the enhancement of reasoning abilities. Additionally, we introduce two new Chinese datasets, MathMC and MathToF, both with detailed explanations and answers. Experiments are conducted on nine benchmarks which demonstrates that our approach improves the reasoning accuracy of LLMs. With GPT-4 and our framework, we achieve new state-of-the-art performance on four math benchmarks (AddSub, SVAMP, Math23K and AQuA) with accuracies of 98.2% (+3.3%), 93.9% (+0.2%), 94.3% (+7.2%) and 81.1% (+1.2%). Our data and code are available at https://github.com/SallyTan13/Teaching-Inspired-Prompting.

Results

Task	Dataset	Metric	Value	Model
Question Answering	Math23K	Accuracy (5-fold)	94.3	GPT-4 (Teaching-Inspired)
Question Answering	SVAMP	Execution Accuracy	93.9	GPT-4 (Teaching-Inspired)
Math Word Problem Solving	Math23K	Accuracy (5-fold)	94.3	GPT-4 (Teaching-Inspired)
Math Word Problem Solving	SVAMP	Execution Accuracy	93.9	GPT-4 (Teaching-Inspired)
Mathematical Question Answering	Math23K	Accuracy (5-fold)	94.3	GPT-4 (Teaching-Inspired)
Mathematical Question Answering	SVAMP	Execution Accuracy	93.9	GPT-4 (Teaching-Inspired)
Mathematical Reasoning	Math23K	Accuracy (5-fold)	94.3	GPT-4 (Teaching-Inspired)
Mathematical Reasoning	SVAMP	Execution Accuracy	93.9	GPT-4 (Teaching-Inspired)
Arithmetic Reasoning	MathToF	Accuracy	89.2	GPT-4 (Teaching-Inspired)
Arithmetic Reasoning	MathMC	Accuracy	92.2	GPT-4 (Teaching-Inspired)
Arithmetic Reasoning	GSM8K	Accuracy	94.8	GPT-4 (Teaching-Inspired)

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Abstract

Results

Related Papers

Teaching-Inspired Integrated Prompting Framework: A Novel Approach for Enhancing Reasoning in Large Language Models

Abstract

Results

Related Papers