Shuwei Shao, Zhongcai Pei, Weihai Chen, Xingming Wu, Zhengguo Li
Monocular depth estimation has drawn widespread attention from the vision community due to its broad applications. In this paper, we propose a novel physics (geometry)-driven deep learning framework for monocular depth estimation by assuming that 3D scenes are constituted by piece-wise planes. Particularly, we introduce a new normal-distance head that outputs pixel-level surface normal and plane-to-origin distance for deriving depth at each position. Meanwhile, the normal and distance are regularized by a developed plane-aware consistency constraint. We further integrate an additional depth head to improve the robustness of the proposed framework. To fully exploit the strengths of these two heads, we develop an effective contrastive iterative refinement module that refines depth in a complementary manner according to the depth uncertainty. Extensive experiments indicate that the proposed method exceeds previous state-of-the-art competitors on the NYU-Depth-v2, KITTI and SUN RGB-D datasets. Notably, it ranks 1st among all submissions on the KITTI depth prediction online benchmark at the submission time.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Depth Estimation | NYU-Depth V2 | Delta < 1.25 | 0.936 | NDDepth |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^2 | 0.991 | NDDepth |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^3 | 0.998 | NDDepth |
| Depth Estimation | NYU-Depth V2 | RMSE | 0.311 | NDDepth |
| Depth Estimation | NYU-Depth V2 | absolute relative error | 0.087 | NDDepth |
| Depth Estimation | NYU-Depth V2 | log 10 | 0.038 | NDDepth |
| Depth Estimation | KITTI Eigen split | Delta < 1.25 | 0.978 | NDDepth |
| Depth Estimation | KITTI Eigen split | Delta < 1.25^2 | 0.998 | NDDepth |
| Depth Estimation | KITTI Eigen split | Delta < 1.25^3 | 0.999 | NDDepth |
| Depth Estimation | KITTI Eigen split | RMSE | 2.025 | NDDepth |
| Depth Estimation | KITTI Eigen split | RMSE log | 0.075 | NDDepth |
| Depth Estimation | KITTI Eigen split | Sq Rel | 0.141 | NDDepth |
| Depth Estimation | KITTI Eigen split | absolute relative error | 0.05 | NDDepth |
| 3D | NYU-Depth V2 | Delta < 1.25 | 0.936 | NDDepth |
| 3D | NYU-Depth V2 | Delta < 1.25^2 | 0.991 | NDDepth |
| 3D | NYU-Depth V2 | Delta < 1.25^3 | 0.998 | NDDepth |
| 3D | NYU-Depth V2 | RMSE | 0.311 | NDDepth |
| 3D | NYU-Depth V2 | absolute relative error | 0.087 | NDDepth |
| 3D | NYU-Depth V2 | log 10 | 0.038 | NDDepth |
| 3D | KITTI Eigen split | Delta < 1.25 | 0.978 | NDDepth |
| 3D | KITTI Eigen split | Delta < 1.25^2 | 0.998 | NDDepth |
| 3D | KITTI Eigen split | Delta < 1.25^3 | 0.999 | NDDepth |
| 3D | KITTI Eigen split | RMSE | 2.025 | NDDepth |
| 3D | KITTI Eigen split | RMSE log | 0.075 | NDDepth |
| 3D | KITTI Eigen split | Sq Rel | 0.141 | NDDepth |
| 3D | KITTI Eigen split | absolute relative error | 0.05 | NDDepth |