ASyMOB

ASyMOB: Algebraic Symbolic Mathematical Operations Benchmark

TextsCC BY-SA 4.0Introduced 2025-04-30

ASyMOB (pronounced Asimov, in tribute to the renowned author), is a novel assessment framework focused exclusively on symbolic manipulation, featuring 17,092 unique math challenges, organized by similarity and complexity. ASyMOB enables analysis of LLM failure root-causes and generalization capabilities by comparing performance in problems that differ by simple numerical or symbolic "perturbations".