Jinyoung Jun, Jae-Han Lee, Chul Lee, Chang-Su Kim
We propose a novel algorithm for monocular depth estimation that decomposes a metric depth map into a normalized depth map and scale features. The proposed network is composed of a shared encoder and three decoders, called G-Net, N-Net, and M-Net, which estimate gradient maps, a normalized depth map, and a metric depth map, respectively. M-Net learns to estimate metric depths more accurately using relative depth features extracted by G-Net and N-Net. The proposed algorithm has the advantage that it can use datasets without metric depth labels to improve the performance of metric depth estimation. Experimental results on various datasets demonstrate that the proposed algorithm not only provides competitive performance to state-of-the-art algorithms but also yields acceptable results even when only a small amount of metric depth data is available for its training.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Depth Estimation | NYU-Depth V2 | Delta < 1.25 | 0.913 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^2 | 0.987 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^3 | 0.998 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | RMSE | 0.355 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | absolute relative error | 0.098 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | log 10 | 0.042 | Depth-Map-Decomposition-HRWSI |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25 | 0.907 | Depth-Map-Decomposition |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^2 | 0.986 | Depth-Map-Decomposition |
| Depth Estimation | NYU-Depth V2 | Delta < 1.25^3 | 0.997 | Depth-Map-Decomposition |
| Depth Estimation | NYU-Depth V2 | RMSE | 0.362 | Depth-Map-Decomposition |
| Depth Estimation | NYU-Depth V2 | absolute relative error | 0.1 | Depth-Map-Decomposition |
| Depth Estimation | NYU-Depth V2 | log 10 | 0.043 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | Delta < 1.25 | 0.913 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | Delta < 1.25^2 | 0.987 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | Delta < 1.25^3 | 0.998 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | RMSE | 0.355 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | absolute relative error | 0.098 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | log 10 | 0.042 | Depth-Map-Decomposition-HRWSI |
| 3D | NYU-Depth V2 | Delta < 1.25 | 0.907 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | Delta < 1.25^2 | 0.986 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | Delta < 1.25^3 | 0.997 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | RMSE | 0.362 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | absolute relative error | 0.1 | Depth-Map-Decomposition |
| 3D | NYU-Depth V2 | log 10 | 0.043 | Depth-Map-Decomposition |