TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Global-Local Path Networks for Monocular Depth Estimation ...

Global-Local Path Networks for Monocular Depth Estimation with Vertical CutDepth

Doyeon Kim, Woonghyun Ka, Pyungwhan Ahn, Donggyu Joo, Sehwan Chun, Junmo Kim

2022-01-19Depth EstimationMonocular Depth Estimation
PaperPDFCodeCodeCodeCode(official)

Abstract

Depth estimation from a single image is an important task that can be applied to various fields in computer vision, and has grown rapidly with the development of convolutional neural networks. In this paper, we propose a novel structure and training strategy for monocular depth estimation to further improve the prediction accuracy of the network. We deploy a hierarchical transformer encoder to capture and convey the global context, and design a lightweight yet powerful decoder to generate an estimated depth map while considering local connectivity. By constructing connected paths between multi-scale local features and the global decoding stream with our proposed selective feature fusion module, the network can integrate both representations and recover fine details. In addition, the proposed decoder shows better performance than the previously proposed decoders, with considerably less computational complexity. Furthermore, we improve the depth-specific augmentation method by utilizing an important observation in depth estimation to enhance the model. Our network achieves state-of-the-art performance over the challenging depth dataset NYU Depth V2. Extensive experiments have been conducted to validate and show the effectiveness of the proposed approach. Finally, our model shows better generalisation ability and robustness than other comparative models.

Results

TaskDatasetMetricValueModel
Depth EstimationNYU-Depth V2Delta < 1.250.915GLPDepth
Depth EstimationNYU-Depth V2Delta < 1.25^20.988GLPDepth
Depth EstimationNYU-Depth V2Delta < 1.25^30.997GLPDepth
Depth EstimationNYU-Depth V2RMSE0.344GLPDepth
Depth EstimationNYU-Depth V2absolute relative error0.098GLPDepth
Depth EstimationNYU-Depth V2log 100.042GLPDepth
Depth EstimationKITTI Eigen splitDelta < 1.250.967GLPDepth
Depth EstimationKITTI Eigen splitDelta < 1.25^20.996GLPDepth
Depth EstimationKITTI Eigen splitDelta < 1.25^30.999GLPDepth
Depth EstimationKITTI Eigen splitRMSE2.297GLPDepth
Depth EstimationKITTI Eigen splitRMSE log0.086GLPDepth
Depth EstimationKITTI Eigen splitabsolute relative error0.057GLPDepth
3DNYU-Depth V2Delta < 1.250.915GLPDepth
3DNYU-Depth V2Delta < 1.25^20.988GLPDepth
3DNYU-Depth V2Delta < 1.25^30.997GLPDepth
3DNYU-Depth V2RMSE0.344GLPDepth
3DNYU-Depth V2absolute relative error0.098GLPDepth
3DNYU-Depth V2log 100.042GLPDepth
3DKITTI Eigen splitDelta < 1.250.967GLPDepth
3DKITTI Eigen splitDelta < 1.25^20.996GLPDepth
3DKITTI Eigen splitDelta < 1.25^30.999GLPDepth
3DKITTI Eigen splitRMSE2.297GLPDepth
3DKITTI Eigen splitRMSE log0.086GLPDepth
3DKITTI Eigen splitabsolute relative error0.057GLPDepth

Related Papers

$S^2M^2$: Scalable Stereo Matching Model for Reliable Depth Estimation2025-07-17$π^3$: Scalable Permutation-Equivariant Visual Geometry Learning2025-07-17Efficient Calisthenics Skills Classification through Foreground Instance Selection and Depth Estimation2025-07-16Vision-based Perception for Autonomous Vehicles in Obstacle Avoidance Scenarios2025-07-16MonoMVSNet: Monocular Priors Guided Multi-View Stereo Network2025-07-15Towards Depth Foundation Model: Recent Trends in Vision-Based Depth Estimation2025-07-15Cameras as Relative Positional Encoding2025-07-14ByDeWay: Boost Your multimodal LLM with DEpth prompting in a Training-Free Way2025-07-11