Chen Liang, Wenguan Wang, Jiaxu Miao, Yi Yang
Recent advances in semi-supervised semantic segmentation have been heavily reliant on pseudo labeling to compensate for limited labeled data, disregarding the valuable relational knowledge among semantic concepts. To bridge this gap, we devise LogicDiag, a brand new neural-logic semi-supervised learning framework. Our key insight is that conflicts within pseudo labels, identified through symbolic knowledge, can serve as strong yet commonly ignored learning signals. LogicDiag resolves such conflicts via reasoning with logic-induced diagnoses, enabling the recovery of (potentially) erroneous pseudo labels, ultimately alleviating the notorious error accumulation problem. We showcase the practical application of LogicDiag in the data-hungry segmentation scenario, where we formalize the structured abstraction of semantic concepts as a set of logic rules. Extensive experiments on three standard semi-supervised semantic segmentation benchmarks demonstrate the effectiveness and generality of LogicDiag. Moreover, LogicDiag highlights the promising opportunities arising from the systematic integration of symbolic reasoning into the prevalent statistical, neural learning approaches.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Semantic Segmentation | COCO 1/512 labeled | Validation mIoU | 33.1 | LogicDiag |
| Semantic Segmentation | COCO 1/256 labeled | Validation mIoU | 40.3 | LogicDiag |
| Semantic Segmentation | PASCAL VOC 2012 92 labeled | Validation mIoU | 73.3 | LogicDiag (DeepLab v3+ with ResNet-101) |
| Semantic Segmentation | PASCAL VOC 2012 732 labeled | Validation mIoU | 79.4 | LogicDiag (DeepLab v3+ with ResNet-101) |
| Semantic Segmentation | COCO 1/128 labeled | Validation mIoU | 45.4 | LogicDiag |
| Semantic Segmentation | COCO 1/64 labeled | Validation mIoU | 48.8 | LogicDiag |
| Semantic Segmentation | PASCAL VOC 2012 366 labeled | Validation mIoU | 77.9 | LogicDiag (DeepLab v3+ with ResNet-101) |
| Semantic Segmentation | COCO 1/32 labeled | Validation mIoU | 50.5 | LogicDiag |
| Semantic Segmentation | PASCAL VOC 2012 183 labeled | Validation mIoU | 76.7 | LogicDiag (DeepLab v3+ with ResNet-101) |
| 10-shot image generation | COCO 1/512 labeled | Validation mIoU | 33.1 | LogicDiag |
| 10-shot image generation | COCO 1/256 labeled | Validation mIoU | 40.3 | LogicDiag |
| 10-shot image generation | PASCAL VOC 2012 92 labeled | Validation mIoU | 73.3 | LogicDiag (DeepLab v3+ with ResNet-101) |
| 10-shot image generation | PASCAL VOC 2012 732 labeled | Validation mIoU | 79.4 | LogicDiag (DeepLab v3+ with ResNet-101) |
| 10-shot image generation | COCO 1/128 labeled | Validation mIoU | 45.4 | LogicDiag |
| 10-shot image generation | COCO 1/64 labeled | Validation mIoU | 48.8 | LogicDiag |
| 10-shot image generation | PASCAL VOC 2012 366 labeled | Validation mIoU | 77.9 | LogicDiag (DeepLab v3+ with ResNet-101) |
| 10-shot image generation | COCO 1/32 labeled | Validation mIoU | 50.5 | LogicDiag |
| 10-shot image generation | PASCAL VOC 2012 183 labeled | Validation mIoU | 76.7 | LogicDiag (DeepLab v3+ with ResNet-101) |