Geoff French, Samuli Laine, Timo Aila, Michal Mackiewicz, Graham Finlayson
Consistency regularization describes a class of approaches that have yielded ground breaking results in semi-supervised classification problems. Prior work has established the cluster assumption - under which the data distribution consists of uniform class clusters of samples separated by low density regions - as important to its success. We analyze the problem of semantic segmentation and find that its' distribution does not exhibit low density regions separating classes and offer this as an explanation for why semi-supervised segmentation is a challenging problem, with only a few reports of success. We then identify choice of augmentation as key to obtaining reliable performance without such low-density regions. We find that adapted variants of the recently proposed CutOut and CutMix augmentation techniques yield state-of-the-art semi-supervised semantic segmentation results in standard datasets. Furthermore, given its challenging nature we propose that semantic segmentation acts as an effective acid test for evaluating semi-supervised regularizers. Implementation at: https://github.com/Britefury/cutmix-semisup-seg.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | ScribbleKITTI | mIoU (1% Labels) | 36.7 | CutMix-Seg (Range View) |
| Semantic Segmentation | ScribbleKITTI | mIoU (10% Labels) | 50.7 | CutMix-Seg (Range View) |
| Semantic Segmentation | ScribbleKITTI | mIoU (20% Labels) | 52.9 | CutMix-Seg (Range View) |
| Semantic Segmentation | ScribbleKITTI | mIoU (50% Labels) | 54.3 | CutMix-Seg (Range View) |
| Semantic Segmentation | ADE20K 1/16 labeled | Validation mIoU | 29.8 | CutMix |
| Semantic Segmentation | ADE20K 1/32 labeled | Validation mIoU | 26.2 | CutMix |
| Semantic Segmentation | Cityscapes 100 samples labeled | Validation mIoU | 51.2 | CutMix (DeepLab v2, ImageNet pre-trained) |
| Semantic Segmentation | SemanticKITTI | mIoU (1% Labels) | 37.4 | CutMix-Seg (Range View) |
| Semantic Segmentation | SemanticKITTI | mIoU (10% Labels) | 54.3 | CutMix-Seg (Range View) |
| Semantic Segmentation | SemanticKITTI | mIoU (20% Labels) | 56.6 | CutMix-Seg (Range View) |
| Semantic Segmentation | SemanticKITTI | mIoU (50% Labels) | 57.6 | CutMix-Seg (Range View) |
| Semantic Segmentation | nuScenes | mIoU (1% Labels) | 43.8 | CutMix-Seg (Range View) |
| Semantic Segmentation | nuScenes | mIoU (10% Labels) | 63.9 | CutMix-Seg (Range View) |
| Semantic Segmentation | nuScenes | mIoU (20% Labels) | 64.8 | CutMix-Seg (Range View) |
| Semantic Segmentation | nuScenes | mIoU (50% Labels) | 69.8 | CutMix-Seg (Range View) |
| 10-shot image generation | ScribbleKITTI | mIoU (1% Labels) | 36.7 | CutMix-Seg (Range View) |
| 10-shot image generation | ScribbleKITTI | mIoU (10% Labels) | 50.7 | CutMix-Seg (Range View) |
| 10-shot image generation | ScribbleKITTI | mIoU (20% Labels) | 52.9 | CutMix-Seg (Range View) |
| 10-shot image generation | ScribbleKITTI | mIoU (50% Labels) | 54.3 | CutMix-Seg (Range View) |
| 10-shot image generation | ADE20K 1/16 labeled | Validation mIoU | 29.8 | CutMix |
| 10-shot image generation | ADE20K 1/32 labeled | Validation mIoU | 26.2 | CutMix |
| 10-shot image generation | Cityscapes 100 samples labeled | Validation mIoU | 51.2 | CutMix (DeepLab v2, ImageNet pre-trained) |
| 10-shot image generation | SemanticKITTI | mIoU (1% Labels) | 37.4 | CutMix-Seg (Range View) |
| 10-shot image generation | SemanticKITTI | mIoU (10% Labels) | 54.3 | CutMix-Seg (Range View) |
| 10-shot image generation | SemanticKITTI | mIoU (20% Labels) | 56.6 | CutMix-Seg (Range View) |
| 10-shot image generation | SemanticKITTI | mIoU (50% Labels) | 57.6 | CutMix-Seg (Range View) |
| 10-shot image generation | nuScenes | mIoU (1% Labels) | 43.8 | CutMix-Seg (Range View) |
| 10-shot image generation | nuScenes | mIoU (10% Labels) | 63.9 | CutMix-Seg (Range View) |
| 10-shot image generation | nuScenes | mIoU (20% Labels) | 64.8 | CutMix-Seg (Range View) |
| 10-shot image generation | nuScenes | mIoU (50% Labels) | 69.8 | CutMix-Seg (Range View) |