Tuan-Hung Vu, Himalaya Jain, Maxime Bucher, Matthieu Cord, Patrick Pérez
Semantic segmentation is a key problem for many computer vision tasks. While approaches based on convolutional neural networks constantly break new records on different benchmarks, generalizing well to diverse testing environments remains a major challenge. In numerous real world applications, there is indeed a large gap between data distributions in train and test domains, which results in severe performance loss at run-time. In this work, we address the task of unsupervised domain adaptation in semantic segmentation with losses based on the entropy of the pixel-wise predictions. To this end, we propose two novel, complementary methods using (i) entropy loss and (ii) adversarial loss respectively. We demonstrate state-of-the-art performance in semantic segmentation on two challenging "synthetic-2-real" set-ups and show that the approach can also be used for detection.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Image-to-Image Translation | SYNTHIA-to-Cityscapes | mIoU (13 classes) | 48 | ADVENT |
| Image-to-Image Translation | GTAV-to-Cityscapes Labels | mIoU | 44.8 | ADVENT |
| Image-to-Image Translation | GTAV-to-Cityscapes Labels | mIoU | 45.5 | AdvEnt(with MinEnt) |
| Domain Adaptation | SYNTHIA-to-Cityscapes | mIoU | 41.2 | ADVENT (ResNet-101) |
| Domain Adaptation | Panoptic SYNTHIA-to-Mapillary | mPQ | 18.3 | ADVENT |
| Domain Adaptation | Panoptic SYNTHIA-to-Cityscapes | mPQ | 28.1 | ADVENT |
| Image Generation | SYNTHIA-to-Cityscapes | mIoU (13 classes) | 48 | ADVENT |
| Image Generation | GTAV-to-Cityscapes Labels | mIoU | 44.8 | ADVENT |
| Image Generation | GTAV-to-Cityscapes Labels | mIoU | 45.5 | AdvEnt(with MinEnt) |
| 1 Image, 2*2 Stitching | SYNTHIA-to-Cityscapes | mIoU (13 classes) | 48 | ADVENT |
| 1 Image, 2*2 Stitching | GTAV-to-Cityscapes Labels | mIoU | 44.8 | ADVENT |
| 1 Image, 2*2 Stitching | GTAV-to-Cityscapes Labels | mIoU | 45.5 | AdvEnt(with MinEnt) |