TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/IEBins: Iterative Elastic Bins for Monocular Depth Estimat...

IEBins: Iterative Elastic Bins for Monocular Depth Estimation

Shuwei Shao, Zhongcai Pei, Xingming Wu, Zhong Liu, Weihai Chen, Zhengguo Li

2023-09-25NeurIPS 2023 11regressionDepth EstimationMonocular Depth Estimation
PaperPDFCode(official)

Abstract

Monocular depth estimation (MDE) is a fundamental topic of geometric computer vision and a core technique for many downstream applications. Recently, several methods reframe the MDE as a classification-regression problem where a linear combination of probabilistic distribution and bin centers is used to predict depth. In this paper, we propose a novel concept of iterative elastic bins (IEBins) for the classification-regression-based MDE. The proposed IEBins aims to search for high-quality depth by progressively optimizing the search range, which involves multiple stages and each stage performs a finer-grained depth search in the target bin on top of its previous stage. To alleviate the possible error accumulation during the iterative process, we utilize a novel elastic target bin to replace the original target bin, the width of which is adjusted elastically based on the depth uncertainty. Furthermore, we develop a dedicated framework composed of a feature extractor and an iterative optimizer that has powerful temporal context modeling capabilities benefiting from the GRU-based architecture. Extensive experiments on the KITTI, NYU-Depth-v2 and SUN RGB-D datasets demonstrate that the proposed method surpasses prior state-of-the-art competitors. The source code is publicly available at https://github.com/ShuweiShao/IEBins.

Results

TaskDatasetMetricValueModel
Depth EstimationNYU-Depth V2Delta < 1.250.936IEBins
Depth EstimationNYU-Depth V2Delta < 1.25^20.992IEBins
Depth EstimationNYU-Depth V2Delta < 1.25^30.998IEBins
Depth EstimationNYU-Depth V2RMSE0.314IEBins
Depth EstimationNYU-Depth V2absolute relative error0.087IEBins
Depth EstimationNYU-Depth V2log 100.038IEBins
Depth EstimationKITTI Eigen splitDelta < 1.250.978IEBins
Depth EstimationKITTI Eigen splitDelta < 1.25^20.998IEBins
Depth EstimationKITTI Eigen splitDelta < 1.25^30.999IEBins
Depth EstimationKITTI Eigen splitRMSE2.011IEBins
Depth EstimationKITTI Eigen splitRMSE log0.075IEBins
Depth EstimationKITTI Eigen splitSq Rel0.142IEBins
Depth EstimationKITTI Eigen splitabsolute relative error0.05IEBins
3DNYU-Depth V2Delta < 1.250.936IEBins
3DNYU-Depth V2Delta < 1.25^20.992IEBins
3DNYU-Depth V2Delta < 1.25^30.998IEBins
3DNYU-Depth V2RMSE0.314IEBins
3DNYU-Depth V2absolute relative error0.087IEBins
3DNYU-Depth V2log 100.038IEBins
3DKITTI Eigen splitDelta < 1.250.978IEBins
3DKITTI Eigen splitDelta < 1.25^20.998IEBins
3DKITTI Eigen splitDelta < 1.25^30.999IEBins
3DKITTI Eigen splitRMSE2.011IEBins
3DKITTI Eigen splitRMSE log0.075IEBins
3DKITTI Eigen splitSq Rel0.142IEBins
3DKITTI Eigen splitabsolute relative error0.05IEBins

Related Papers

Language Integration in Fine-Tuning Multimodal Large Language Models for Image-Based Regression2025-07-20$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Neural Network-Guided Symbolic Regression for Interpretable Descriptor Discovery in Perovskite Catalysts2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16Second-Order Bounds for [0,1]-Valued Regression via Betting Loss2025-07-16Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16