TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Think before You Simulate: Symbolic Reasoning to Orchestra...

Think before You Simulate: Symbolic Reasoning to Orchestrate Neural Computation for Counterfactual Question Answering

Adam Ishay, Zhun Yang, Joohyung Lee, Ilgu Kang, Dongjae Lim

2025-06-12IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2024 1Question AnsweringCounterfactual Reasoning
PaperPDFCode

Abstract

Causal and temporal reasoning about video dynamics is a challenging problem. While neuro-symbolic models that combine symbolic reasoning with neural-based perception and prediction have shown promise, they exhibit limitations, especially in answering counterfactual questions. This paper introduces a method to enhance a neuro-symbolic model for counterfactual reasoning, leveraging symbolic reasoning about causal relations among events. We define the notion of a causal graph to represent such relations and use Answer Set Programming (ASP), a declarative logic programming method, to find how to coordinate perception and simulation modules. We validate the effectiveness of our approach on two benchmarks, CLEVRER and CRAFT. Our enhancement achieves state-of-the-art performance on the CLEVRER challenge, significantly outperforming existing models. In the case of the CRAFT benchmark, we leverage a large pre-trained language model, such as GPT-3.5 and GPT-4, as a proxy for a dynamics simulator. Our findings show that this method can further improve its performance on counterfactual questions by providing alternative prompts instructed by symbolic causal reasoning.

Results

TaskDatasetMetricValueModel
Visual ReasoningCLEVRERAverage-per ques.95.24AI Core
Visual ReasoningCLEVRERCounterfactual-per opt.96.61AI Core
Visual ReasoningCLEVRERCounterfactual-per ques.90.72AI Core
Visual ReasoningCLEVRERDescriptive96.46AI Core
Visual ReasoningCLEVRERExplanatory-per opt.99.94AI Core
Visual ReasoningCLEVRERExplanatory-per ques.99.81AI Core
Visual ReasoningCLEVRERPredictive-per opt.93.96AI Core
Visual ReasoningCLEVRERPredictive-per ques.93.96AI Core

Related Papers

From Roots to Rewards: Dynamic Tree Reasoning with RL2025-07-17Enter the Mind Palace: Reasoning and Planning for Long-term Active Embodied Question Answering2025-07-17Vision-and-Language Training Helps Deploy Taxonomic Knowledge but Does Not Fundamentally Alter It2025-07-17City-VLM: Towards Multidomain Perception Scene Understanding via Multimodal Incomplete Learning2025-07-17Describe Anything Model for Visual Question Answering on Text-rich Images2025-07-16Is This Just Fantasy? Language Model Representations Reflect Human Judgments of Event Plausibility2025-07-16Warehouse Spatial Question Answering with LLM Agent2025-07-14Evaluating Attribute Confusion in Fashion Text-to-Image Generation2025-07-09