TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/HR-Depth: High Resolution Self-Supervised Monocular Depth ...

HR-Depth: High Resolution Self-Supervised Monocular Depth Estimation

Xiaoyang Lyu, Liang Liu, Mengmeng Wang, Xin Kong, Lina Liu, Yong liu, Xinxin Chen, Yi Yuan

2020-12-14Vocal Bursts Intensity PredictionSelf-Supervised LearningUnsupervised Monocular Depth EstimationDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Self-supervised learning shows great potential in monoculardepth estimation, using image sequences as the only source ofsupervision. Although people try to use the high-resolutionimage for depth estimation, the accuracy of prediction hasnot been significantly improved. In this work, we find thecore reason comes from the inaccurate depth estimation inlarge gradient regions, making the bilinear interpolation er-ror gradually disappear as the resolution increases. To obtainmore accurate depth estimation in large gradient regions, itis necessary to obtain high-resolution features with spatialand semantic information. Therefore, we present an improvedDepthNet, HR-Depth, with two effective strategies: (1) re-design the skip-connection in DepthNet to get better high-resolution features and (2) propose feature fusion Squeeze-and-Excitation(fSE) module to fuse feature more efficiently.Using Resnet-18 as the encoder, HR-Depth surpasses all pre-vious state-of-the-art(SoTA) methods with the least param-eters at both high and low resolution. Moreover, previousstate-of-the-art methods are based on fairly complex and deepnetworks with a mass of parameters which limits their realapplications. Thus we also construct a lightweight networkwhich uses MobileNetV3 as encoder. Experiments show thatthe lightweight network can perform on par with many largemodels like Monodepth2 at high-resolution with only20%parameters. All codes and models will be available at https://github.com/shawLyu/HR-Depth.

Results

TaskDatasetMetricValueModel
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.101HR-Depth-MS-1024X320
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.104HR-Depth-M-1280x384
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.104Lite-HR-Depth-T-1280x384
Depth EstimationKITTI Eigen split unsupervisedabsolute relative error0.109HR-Depth-M-640x192
3DKITTI Eigen split unsupervisedabsolute relative error0.101HR-Depth-MS-1024X320
3DKITTI Eigen split unsupervisedabsolute relative error0.104HR-Depth-M-1280x384
3DKITTI Eigen split unsupervisedabsolute relative error0.104Lite-HR-Depth-T-1280x384
3DKITTI Eigen split unsupervisedabsolute relative error0.109HR-Depth-M-640x192

Related Papers

A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Self-supervised Learning on Camera Trap Footage Yields a Strong Universal Face Embedder2025-07-14