GSM-Plus
TextsIntroduced 2024-02-29
By perturbing the widely used GSM8K dataset, an adversarial dataset for grade-school math called GSM-Plus is created. Motivated by the capability taxonomy for solving math problems mentioned in Polya's principles, this paper identifies 5 perspectives to guide the development of GSM-Plus:
- Numerical Variation refers to altering the numerical data or its types, including 3 subcategories: Numerical Substitution, Digit Expansion, and Integer-decimal-fraction Conversion.
- Arithmetic Variation refers to reversing or introducing additional operations (e.g., addition, subtraction, multiplication, and division) to math problems, including 2 subcategories: Adding Operation and Reversing Operation.
- Problem Understanding refers to rephrasing the text description of the math problems.
- Distractor Insertion refers to inserting topic-related but useless sentences to the problems.
- Critical Thinking focuses on question or doubt ability when the question lacks necessary statements.
GSM-Plus can be used to evaluate the robustness of current LLMs in mathematical reasoning.