TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Forward Prediction for Physical Reasoning

Forward Prediction for Physical Reasoning

Rohit Girdhar, Laura Gustafson, Aaron Adcock, Laurens van der Maaten

2020-06-18PredictionVisual Reasoning
PaperPDFCode(official)

Abstract

Physical reasoning requires forward prediction: the ability to forecast what will happen next given some initial world state. We study the performance of state-of-the-art forward-prediction models in the complex physical-reasoning tasks of the PHYRE benchmark. We do so by incorporating models that operate on object or pixel-based representations of the world into simple physical-reasoning agents. We find that forward-prediction models can improve physical-reasoning performance, particularly on complex tasks that involve many objects. However, we also find that these improvements are contingent on the test tasks being small variations of train tasks, and that generalization to completely new task templates is challenging. Surprisingly, we observe that forward predictors with better pixel accuracy do not necessarily lead to better physical-reasoning performance.Nevertheless, our best models set a new state-of-the-art on the PHYRE benchmark.

Results

TaskDatasetMetricValueModel
Visual ReasoningPHYRE-1B-WithinAUCCESS80Dec[Joint]1f
Visual ReasoningPHYRE-1B-CrossAUCCESS40.3Dec[Joint]1f

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17Generative Click-through Rate Prediction with Applications to Search Advertising2025-07-15Beyond Task-Specific Reasoning: A Unified Conditional Generative Framework for Abstract Visual Reasoning2025-07-15Conformation-Aware Structure Prediction of Antigen-Recognizing Immune Proteins2025-07-11PyVision: Agentic Vision with Dynamic Tooling2025-07-10Foundation models for time series forecasting: Application in conformal prediction2025-07-09Orchestrator-Agent Trust: A Modular Agentic AI Visual Classification System with Trust-Aware Orchestration and RAG-Based Reasoning2025-07-09