TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/LocalBins: Improving Depth Estimation by Learning Local Di...

LocalBins: Improving Depth Estimation by Learning Local Distributions

Shariq Farooq Bhat, Ibraheem Alhashim, Peter Wonka

2022-03-28Depth EstimationAllMonocular Depth Estimation
PaperPDFCode(official)

Abstract

We propose a novel architecture for depth estimation from a single image. The architecture itself is based on the popular encoder-decoder architecture that is frequently used as a starting point for all dense regression tasks. We build on AdaBins which estimates a global distribution of depth values for the input image and evolve the architecture in two ways. First, instead of predicting global depth distributions, we predict depth distributions of local neighborhoods at every pixel. Second, instead of predicting depth distributions only towards the end of the decoder, we involve all layers of the decoder. We call this new architecture LocalBins. Our results demonstrate a clear improvement over the state-of-the-art in all metrics on the NYU-Depth V2 dataset. Code and pretrained models will be made publicly available.

Results

TaskDatasetMetricValueModel
Depth EstimationNYU-Depth V2Delta < 1.250.91LocalBins
Depth EstimationNYU-Depth V2Delta < 1.25^20.986LocalBins
Depth EstimationNYU-Depth V2Delta < 1.25^30.997LocalBins
Depth EstimationNYU-Depth V2RMSE0.351LocalBins
Depth EstimationNYU-Depth V2absolute relative error0.098LocalBins
Depth EstimationNYU-Depth V2log 100.042LocalBins
3DNYU-Depth V2Delta < 1.250.91LocalBins
3DNYU-Depth V2Delta < 1.25^20.986LocalBins
3DNYU-Depth V2Delta < 1.25^30.997LocalBins
3DNYU-Depth V2RMSE0.351LocalBins
3DNYU-Depth V2absolute relative error0.098LocalBins
3DNYU-Depth V2log 100.042LocalBins

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15