Jinsun Park, Kyungdon Joo, Zhe Hu, Chi-Kuei Liu, In So Kweon
In this paper, we propose a robust and efficient end-to-end non-local spatial propagation network for depth completion. The proposed network takes RGB and sparse depth images as inputs and estimates non-local neighbors and their affinities of each pixel, as well as an initial depth map with pixel-wise confidences. The initial depth prediction is then iteratively refined by its confidence and non-local spatial propagation procedure based on the predicted non-local neighbors and corresponding affinities. Unlike previous algorithms that utilize fixed-local neighbors, the proposed algorithm effectively avoids irrelevant local neighbors and concentrates on relevant non-local neighbors during propagation. In addition, we introduce a learnable affinity normalization to better learn the affinity combinations compared to conventional methods. The proposed algorithm is inherently robust to the mixed-depth problem on depth boundaries, which is one of the major issues for existing depth estimation/completion algorithms. Experimental results on indoor and outdoor datasets demonstrate that the proposed algorithm is superior to conventional algorithms in terms of depth completion accuracy and robustness to the mixed-depth problem. Our implementation is publicly available on the project page.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Depth Estimation | KITTI Depth Completion Validation | RMSE | 771.8 | NLSPN |
| 3D | KITTI Depth Completion Validation | RMSE | 771.8 | NLSPN |
| Depth Completion | KITTI Depth Completion | MAE | 199.59 | NLSPN |
| Depth Completion | KITTI Depth Completion | RMSE | 741.68 | NLSPN |
| Depth Completion | KITTI Depth Completion | Runtime [ms] | 220 | NLSPN |
| Depth Completion | KITTI Depth Completion | iMAE | 0.84 | NLSPN |
| Depth Completion | KITTI Depth Completion | iRMSE | 1.99 | NLSPN |
| Depth Completion | VOID | MAE | 26.736 | NLSPN |
| Depth Completion | VOID | RMSE | 79.121 | NLSPN |
| Depth Completion | VOID | iMAE | 12.703 | NLSPN |
| Depth Completion | VOID | iRMSE | 33.876 | NLSPN |
| Depth Completion | NYU-Depth V2 | REL | 0.012 | NLSPN |
| Depth Completion | NYU-Depth V2 | RMSE | 0.092 | NLSPN |