Qi Fan, Wenjie Pei, Yu-Wing Tai, Chi-Keung Tang
Existing few-shot segmentation methods have achieved great progress based on the support-query matching framework. But they still heavily suffer from the limited coverage of intra-class variations from the few-shot supports provided. Motivated by the simple Gestalt principle that pixels belonging to the same object are more similar than those to different objects of same class, we propose a novel self-support matching strategy to alleviate this problem, which uses query prototypes to match query features, where the query prototypes are collected from high-confidence query predictions. This strategy can effectively capture the consistent underlying characteristics of the query objects, and thus fittingly match query features. We also propose an adaptive self-support background prototype generation module and self-support loss to further facilitate the self-support matching procedure. Our self-support network substantially improves the prototype quality, benefits more improvement from stronger backbones and more supports, and achieves SOTA on multiple datasets. Codes are at \url{https://github.com/fanq15/SSP}.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Few-Shot Learning | FSS-1000 (5-shot) | Mean IoU | 88.6 | SSP |
| Few-Shot Learning | COCO-20i (5-shot) | Mean IoU | 50.2 | SSP (ResNet-101) |
| Few-Shot Learning | COCO-20i (5-shot) | Mean IoU | 44.1 | SSP (ResNet-50) |
| Few-Shot Learning | FSS-1000 (1-shot) | Mean IoU | 87.3 | SSP |
| Few-Shot Learning | PASCAL-5i (1-Shot) | Mean IoU | 64.6 | SSP (ResNet-101) |
| Few-Shot Learning | PASCAL-5i (1-Shot) | Mean IoU | 61.4 | SSP (ResNet-50) |
| Few-Shot Learning | COCO-20i (1-shot) | Mean IoU | 42 | SSP (ResNet-101) |
| Few-Shot Learning | COCO-20i (1-shot) | Mean IoU | 37.4 | SSP (ResNet-50) |
| Few-Shot Learning | PASCAL-5i (5-Shot) | Mean IoU | 73.1 | SSP (ResNet-101) |
| Few-Shot Learning | PASCAL-5i (5-Shot) | Mean IoU | 69.3 | SSP (ResNet-50) |
| Few-Shot Semantic Segmentation | FSS-1000 (5-shot) | Mean IoU | 88.6 | SSP |
| Few-Shot Semantic Segmentation | COCO-20i (5-shot) | Mean IoU | 50.2 | SSP (ResNet-101) |
| Few-Shot Semantic Segmentation | COCO-20i (5-shot) | Mean IoU | 44.1 | SSP (ResNet-50) |
| Few-Shot Semantic Segmentation | FSS-1000 (1-shot) | Mean IoU | 87.3 | SSP |
| Few-Shot Semantic Segmentation | PASCAL-5i (1-Shot) | Mean IoU | 64.6 | SSP (ResNet-101) |
| Few-Shot Semantic Segmentation | PASCAL-5i (1-Shot) | Mean IoU | 61.4 | SSP (ResNet-50) |
| Few-Shot Semantic Segmentation | COCO-20i (1-shot) | Mean IoU | 42 | SSP (ResNet-101) |
| Few-Shot Semantic Segmentation | COCO-20i (1-shot) | Mean IoU | 37.4 | SSP (ResNet-50) |
| Few-Shot Semantic Segmentation | PASCAL-5i (5-Shot) | Mean IoU | 73.1 | SSP (ResNet-101) |
| Few-Shot Semantic Segmentation | PASCAL-5i (5-Shot) | Mean IoU | 69.3 | SSP (ResNet-50) |
| Meta-Learning | FSS-1000 (5-shot) | Mean IoU | 88.6 | SSP |
| Meta-Learning | COCO-20i (5-shot) | Mean IoU | 50.2 | SSP (ResNet-101) |
| Meta-Learning | COCO-20i (5-shot) | Mean IoU | 44.1 | SSP (ResNet-50) |
| Meta-Learning | FSS-1000 (1-shot) | Mean IoU | 87.3 | SSP |
| Meta-Learning | PASCAL-5i (1-Shot) | Mean IoU | 64.6 | SSP (ResNet-101) |
| Meta-Learning | PASCAL-5i (1-Shot) | Mean IoU | 61.4 | SSP (ResNet-50) |
| Meta-Learning | COCO-20i (1-shot) | Mean IoU | 42 | SSP (ResNet-101) |
| Meta-Learning | COCO-20i (1-shot) | Mean IoU | 37.4 | SSP (ResNet-50) |
| Meta-Learning | PASCAL-5i (5-Shot) | Mean IoU | 73.1 | SSP (ResNet-101) |
| Meta-Learning | PASCAL-5i (5-Shot) | Mean IoU | 69.3 | SSP (ResNet-50) |