TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/The Drunkard's Odometry: Estimating Camera Motion in Defor...

The Drunkard's Odometry: Estimating Camera Motion in Deforming Scenes

David Recasens, Martin R. Oswald, Marc Pollefeys, Javier Civera

2023-06-29Visual NavigationOptical Flow Estimation6D Pose Estimation using RGBD
PaperPDFCode(official)

Abstract

Estimating camera motion in deformable scenes poses a complex and open research challenge. Most existing non-rigid structure from motion techniques assume to observe also static scene parts besides deforming scene parts in order to establish an anchoring reference. However, this assumption does not hold true in certain relevant application cases such as endoscopies. Deformable odometry and SLAM pipelines, which tackle the most challenging scenario of exploratory trajectories, suffer from a lack of robustness and proper quantitative evaluation methodologies. To tackle this issue with a common benchmark, we introduce the Drunkard's Dataset, a challenging collection of synthetic data targeting visual navigation and reconstruction in deformable environments. This dataset is the first large set of exploratory camera trajectories with ground truth inside 3D scenes where every surface exhibits non-rigid deformations over time. Simulations in realistic 3D buildings lets us obtain a vast amount of data and ground truth labels, including camera poses, RGB images and depth, optical flow and normal maps at high resolution and quality. We further present a novel deformable odometry method, dubbed the Drunkard's Odometry, which decomposes optical flow estimates into rigid-body camera motion and non-rigid scene deformations. In order to validate our data, our work contains an evaluation of several baselines as well as a novel tracking error metric which does not require ground truth data. Dataset and code: https://davidrecasens.github.io/TheDrunkard'sOdometry/

Results

TaskDatasetMetricValueModel
Pose EstimationDrunkard's Dataset Level3 Scene0Absolute Trajectory Error [m]1.74Drunkard's Odometry
Pose EstimationDrunkard's Dataset Level3 Scene0Relative Position Error Rotation [º]0.48Drunkard's Odometry
Pose EstimationDrunkard's Dataset Level3 Scene0Relative Position Error Translation [cm]1.82Drunkard's Odometry
3DDrunkard's Dataset Level3 Scene0Absolute Trajectory Error [m]1.74Drunkard's Odometry
3DDrunkard's Dataset Level3 Scene0Relative Position Error Rotation [º]0.48Drunkard's Odometry
3DDrunkard's Dataset Level3 Scene0Relative Position Error Translation [cm]1.82Drunkard's Odometry
1 Image, 2*2 StitchiDrunkard's Dataset Level3 Scene0Absolute Trajectory Error [m]1.74Drunkard's Odometry
1 Image, 2*2 StitchiDrunkard's Dataset Level3 Scene0Relative Position Error Rotation [º]0.48Drunkard's Odometry
1 Image, 2*2 StitchiDrunkard's Dataset Level3 Scene0Relative Position Error Translation [cm]1.82Drunkard's Odometry

Related Papers

Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17An Efficient Approach for Muscle Segmentation and 3D Reconstruction Using Keypoint Tracking in MRI Scan2025-07-11Learning to Track Any Points from Human Motion2025-07-08TLB-VFI: Temporal-Aware Latent Brownian Bridge Diffusion for Video Frame Interpolation2025-07-07MEMFOF: High-Resolution Training for Memory-Efficient Multi-Frame Optical Flow Estimation2025-06-29EndoFlow-SLAM: Real-Time Endoscopic SLAM with Flow-Constrained Gaussian Splatting2025-06-26WAFT: Warping-Alone Field Transforms for Optical Flow2025-06-26Feature Hallucination for Self-supervised Action Recognition2025-06-25