TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Supervised Monocular Depth Hints

Self-Supervised Monocular Depth Hints

Jamie Watson, Michael Firman, Gabriel J. Brostow, Daniyar Turmukhambetov

2019-09-19ICCV 2019 10Self-Supervised LearningDepth PredictionDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Monocular depth estimators can be trained with various forms of self-supervision from binocular-stereo data to circumvent the need for high-quality laser scans or other ground-truth data. The disadvantage, however, is that the photometric reprojection losses used with self-supervised learning typically have multiple local minima. These plausible-looking alternatives to ground truth can restrict what a regression network learns, causing it to predict depth maps of limited quality. As one prominent example, depth discontinuities around thin structures are often incorrectly estimated by current state-of-the-art methods. Here, we study the problem of ambiguous reprojections in depth prediction from stereo-based self-supervision, and introduce Depth Hints to alleviate their effects. Depth Hints are complementary depth suggestions obtained from simple off-the-shelf stereo algorithms. These hints enhance an existing photometric loss function, and are used to guide a network to learn better weights. They require no additional data, and are assumed to be right only sometimes. We show that using our Depth Hints gives a substantial boost when training several leading self-supervised-from-stereo models, not just our own. Further, combined with other good practices, we produce state-of-the-art depth predictions on the KITTI benchmark.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen splitabsolute relative error0.096Depth Hints
Depth EstimationVA (Virtual Apartment)Absolute relative error (AbsRel)0.197Depth Hints
Depth EstimationVA (Virtual Apartment)Log root mean square error (RMSE_log)0.248Depth Hints
Depth EstimationVA (Virtual Apartment)Mean average error (MAE) 0.291Depth Hints
Depth EstimationVA (Virtual Apartment)Root mean square error (RMSE)0.427Depth Hints
3DKITTI Eigen splitabsolute relative error0.096Depth Hints
3DVA (Virtual Apartment)Absolute relative error (AbsRel)0.197Depth Hints
3DVA (Virtual Apartment)Log root mean square error (RMSE_log)0.248Depth Hints
3DVA (Virtual Apartment)Mean average error (MAE) 0.291Depth Hints
3DVA (Virtual Apartment)Root mean square error (RMSE)0.427Depth Hints

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14