Koutilya PNVR, Hao Zhou, David Jacobs
We propose a novel method for combining synthetic and real images when training networks to determine geometric information from a single image. We suggest a method for mapping both image types into a single, shared domain. This is connected to a primary network for end-to-end training. Ideally, this results in images from two domains that present shared information to the primary network. Our experiments demonstrate significant improvements over the state-of-the-art in two important domains, surface normal estimation of human faces and monocular depth estimation for outdoor scenes, both in an unsupervised setting.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Depth Estimation | Make3D | Abs Rel | 0.377 | SharinGAN |
| Depth Estimation | Make3D | RMSE | 8.388 | SharinGAN |
| Depth Estimation | Make3D | Sq Rel | 4.9 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | Delta < 1.25 | 0.864 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | Delta < 1.25^2 | 0.954 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | Delta < 1.25^3 | 0.981 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | RMSE | 3.77 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | RMSE log | 0.19 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | Sq Rel | 0.673 | SharinGAN |
| Depth Estimation | KITTI Eigen split unsupervised | absolute relative error | 0.109 | SharinGAN |
| 3D | Make3D | Abs Rel | 0.377 | SharinGAN |
| 3D | Make3D | RMSE | 8.388 | SharinGAN |
| 3D | Make3D | Sq Rel | 4.9 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | Delta < 1.25 | 0.864 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | Delta < 1.25^2 | 0.954 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | Delta < 1.25^3 | 0.981 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | RMSE | 3.77 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | RMSE log | 0.19 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | Sq Rel | 0.673 | SharinGAN |
| 3D | KITTI Eigen split unsupervised | absolute relative error | 0.109 | SharinGAN |