Pose Constraints for Consistent Self-supervised Monocular Depth and Ego-motion

Zeeshan Khan Suri

2023-04-18motion prediction Unsupervised Monocular Depth Estimation Egocentric Pose Estimation Camera Pose Estimation Depth Estimation Monocular Depth Estimation

Paper PDF Code(official)

Abstract

Self-supervised monocular depth estimation approaches suffer not only from scale ambiguity but also infer temporally inconsistent depth maps w.r.t. scale. While disambiguating scale during training is not possible without some kind of ground truth supervision, having scale consistent depth predictions would make it possible to calculate scale once during inference as a post-processing step and use it over-time. With this as a goal, a set of temporal consistency losses that minimize pose inconsistencies over time are introduced. Evaluations show that introducing these constraints not only reduces depth inconsistencies but also improves the baseline performance of depth and ego-motion prediction.

Results

Task	Dataset	Metric	Value	Model
Depth Estimation	KITTI Eigen split unsupervised	absolute relative error	0.113	pc4consistentdepth
3D Human Pose Estimation	Kitti Odometry	Absolute Trajectory Error [m]	0.014	pc4consistentdepth
Pose Estimation	Kitti Odometry	Absolute Trajectory Error [m]	0.014	pc4consistentdepth
3D	Kitti Odometry	Absolute Trajectory Error [m]	0.014	pc4consistentdepth
3D	KITTI Eigen split unsupervised	absolute relative error	0.113	pc4consistentdepth
1 Image, 2*2 Stitchi	Kitti Odometry	Absolute Trajectory Error [m]	0.014	pc4consistentdepth

Related Papers

$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17 $S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17 SpatialTrackerV2: 3D Point Tracking Made Easy2025-07-16 SGLoc: Semantic Localization System for Camera Pose Estimation from 3D Gaussian Splatting Representation2025-07-16 BRUM: Robust 3D Vehicle Reconstruction from 360 Sparse Images2025-07-16 Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16 Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16 MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15