TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Pose Constraints for Consistent Self-supervised Monocular ...

Pose Constraints for Consistent Self-supervised Monocular Depth and Ego-motion

Zeeshan Khan Suri

2023-04-18motion predictionUnsupervised Monocular Depth EstimationEgocentric Pose EstimationCamera Pose EstimationDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Self-supervised monocular depth estimation approaches suffer not only from scale ambiguity but also infer temporally inconsistent depth maps w.r.t. scale. While disambiguating scale during training is not possible without some kind of ground truth supervision, having scale consistent depth predictions would make it possible to calculate scale once during inference as a post-processing step and use it over-time. With this as a goal, a set of temporal consistency losses that minimize pose inconsistencies over time are introduced. Evaluations show that introducing these constraints not only reduces depth inconsistencies but also improves the baseline performance of depth and ego-motion prediction.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.113pc4consistentdepth
3D Human Pose EstimationKitti OdometryAbsolute Trajectory Error [m]0.014pc4consistentdepth
Pose EstimationKitti OdometryAbsolute Trajectory Error [m]0.014pc4consistentdepth
3DKitti OdometryAbsolute Trajectory Error [m]0.014pc4consistentdepth
3DKITTI Eigen split unsupervisedabsolute relative error0.113pc4consistentdepth
1 Image, 2*2 StitchiKitti OdometryAbsolute Trajectory Error [m]0.014pc4consistentdepth

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15