MyGO Multiplex CoT: A Method for Self-Reflection in Large Language Models via Double Chain of Thought Thinking

Shihao Ji, Zihui Song, Fucheng Zhong, Jisen Jia, Zhaobo Wu, Zheyi Cao, Tianhao Xu

2025-01-20LLM real-life tasks Prompt Engineering Decision Making GSM8K MMLU HumanEval

Abstract

Recent advancements in large language models (LLMs) have demonstrated their impressive abilities in various reasoning and decision-making tasks. However, the quality and coherence of the reasoning process can still benefit from enhanced introspection and self-reflection. In this paper, we introduce Multiplex CoT (Chain of Thought), a method that enables LLMs to simulate a form of self-review while reasoning, by initiating double Chain of Thought (CoT) thinking. Multiplex CoT leverages the power of iterative reasoning, where the model generates an initial chain of thought and subsequently critiques and refines this reasoning with a second round of thought generation. This recursive approach allows for more coherent, logical, and robust answers, improving the overall decision-making process. We demonstrate how this method can be effectively implemented using simple prompt engineering in existing LLM architectures, achieving an effect similar to that of the Learning-Refinement Model (LRM) without the need for additional training. Additionally, we present a practical guide for implementing the method in Google Colab, enabling easy integration into real-world applications.

Results

Task	Dataset	Metric	Value	Model
GSM8K	GSM8K	0-shot MRR	98	Orange-mini
MMLU	MMLU-Pro	0-shot MRR	99.19	Orange-mini

Related Papers

Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion2025-07-18 Leveraging Language Prior for Infrared Small Target Detection2025-07-17 Emotional Support with LLM-based Empathetic Dialogue Generation2025-07-17 Higher-Order Pattern Unification Modulo Similarity Relations2025-07-17 Exploiting Constraint Reasoning to Build Graphical Explanations for Mixed-Integer Linear Programming2025-07-17 GEMMAS: Graph-based Evaluation Metrics for Multi Agent Systems2025-07-17 DAC: A Dynamic Attention-aware Approach for Task-Agnostic Prompt Compression2025-07-16 Learning What Matters: Probabilistic Task Selection via Mutual Information for Model Finetuning2025-07-16