TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/M-FUSE: Multi-frame Fusion for Scene Flow Estimation

M-FUSE: Multi-frame Fusion for Scene Flow Estimation

Lukas Mehl, Azin Jahedi, Jenny Schmalfuss, Andrés Bruhn

2022-07-12Scene Flow Estimation
PaperPDFCode(official)

Abstract

Recently, neural network for scene flow estimation show impressive results on automotive data such as the KITTI benchmark. However, despite of using sophisticated rigidity assumptions and parametrizations, such networks are typically limited to only two frame pairs which does not allow them to exploit temporal information. In our paper we address this shortcoming by proposing a novel multi-frame approach that considers an additional preceding stereo pair. To this end, we proceed in two steps: Firstly, building upon the recent RAFT-3D approach, we develop an improved two-frame baseline by incorporating an advanced stereo method. Secondly, and even more importantly, exploiting the specific modeling concepts of RAFT-3D, we propose a U-Net architecture that performs a fusion of forward and backward flow estimates and hence allows to integrate temporal information on demand. Experiments on the KITTI benchmark do not only show that the advantages of the improved baseline and the temporal fusion approach complement each other, they also demonstrate that the computed scene flow is highly accurate. More precisely, our approach ranks second overall and first for the even more challenging foreground objects, in total outperforming the original RAFT-3D method by more than 16%. Code is available at https://github.com/cv-stuttgart/M-FUSE.

Results

TaskDatasetMetricValueModel
Scene Flow EstimationSpring1px total34.896M-FUSE (F)
Scene Flow EstimationSpring1px total62.49M-FUSE (K)

Related Papers

MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second2025-07-14KDMOS:Knowledge Distillation for Motion Segmentation2025-06-17VoxelSplat: Dynamic Gaussian Splatting as an Effective Loss for Occupancy and Flow Prediction2025-06-05Estimating Scene Flow in Robot Surroundings with Distributed Miniaturized Time-of-Flight Sensors2025-04-03VoteFlow: Enforcing Local Rigidity in Self-Supervised Scene Flow2025-03-28Dynamic Point Maps: A Versatile Representation for Dynamic 3D Reconstruction2025-03-20SemanticFlow: A Self-Supervised Framework for Joint Scene Flow Prediction and Instance Segmentation in Dynamic Environments2025-03-19Toward Scalable, Flexible Scene Flow for Point Clouds2025-03-19