Dense Gaussian Processes for Few-Shot Segmentation

Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan

2021-10-07Gaussian Processes Segmentation Few-Shot Semantic Segmentation

Abstract

Few-shot segmentation is a challenging dense prediction task, which entails segmenting a novel query image given only a small annotated support set. The key problem is thus to design a method that aggregates detailed information from the support set, while being robust to large variations in appearance and context. To this end, we propose a few-shot segmentation method based on dense Gaussian process (GP) regression. Given the support set, our dense GP learns the mapping from local deep image features to mask values, capable of capturing complex appearance distributions. Furthermore, it provides a principled means of capturing uncertainty, which serves as another powerful cue for the final segmentation, obtained by a CNN decoder. Instead of a one-dimensional mask output, we further exploit the end-to-end learning capabilities of our approach to learn a high-dimensional output space for the GP. Our approach sets a new state-of-the-art on the PASCAL-5$^i$ and COCO-20$^i$ benchmarks, achieving an absolute gain of $+8.4$ mIoU in the COCO-20$^i$ 5-shot setting. Furthermore, the segmentation quality of our approach scales gracefully when increasing the support set size, while achieving robust cross-dataset transfer. Code and trained models are available at \url{https://github.com/joakimjohnander/dgpnet}.

Results

Task	Dataset	Metric	Value	Model
Few-Shot Learning	COCO-20i (5-shot)	Mean IoU	57.9	DGPNet (ResNet-101)
Few-Shot Learning	COCO-20i (5-shot)	Mean IoU	56.2	DGPNet (ResNet-50)
Few-Shot Learning	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	70.1	DGPNet (ResNet-101)
Few-Shot Learning	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	68.9	DGPNet (ResNet-50)
Few-Shot Learning	PASCAL-5i (10-Shot)	Mean IoU	77.7	DGPNet (ResNet-101)
Few-Shot Learning	COCO-20i (10-shot)	Mean IoU	60.2	DGPNet (ResNet-101)
Few-Shot Learning	PASCAL-5i (1-Shot)	Mean IoU	64.8	DGPNet (ResNet-101)
Few-Shot Learning	PASCAL-5i (1-Shot)	Mean IoU	63.5	DGPNet (ResNet-50)
Few-Shot Learning	COCO-20i (1-shot)	Mean IoU	46.7	DGPNet (ResNet-101)
Few-Shot Learning	COCO-20i (1-shot)	Mean IoU	45	DGPNet (ResNet-50)
Few-Shot Learning	PASCAL-5i (5-Shot)	Mean IoU	75.4	DGPNet (ResNet-101)
Few-Shot Learning	PASCAL-5i (5-Shot)	Mean IoU	73.5	DGPNet (ResNet-50)
Few-Shot Learning	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	78.5	DGPNet (ResNet-101)
Few-Shot Learning	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	77.5	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	Mean IoU	57.9	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	COCO-20i (5-shot)	Mean IoU	56.2	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	70.1	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	68.9	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	PASCAL-5i (10-Shot)	Mean IoU	77.7	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	COCO-20i (10-shot)	Mean IoU	60.2	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	Mean IoU	64.8	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	PASCAL-5i (1-Shot)	Mean IoU	63.5	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	Mean IoU	46.7	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	COCO-20i (1-shot)	Mean IoU	45	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	Mean IoU	75.4	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	PASCAL-5i (5-Shot)	Mean IoU	73.5	DGPNet (ResNet-50)
Few-Shot Semantic Segmentation	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	78.5	DGPNet (ResNet-101)
Few-Shot Semantic Segmentation	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	77.5	DGPNet (ResNet-50)
Meta-Learning	COCO-20i (5-shot)	Mean IoU	57.9	DGPNet (ResNet-101)
Meta-Learning	COCO-20i (5-shot)	Mean IoU	56.2	DGPNet (ResNet-50)
Meta-Learning	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	70.1	DGPNet (ResNet-101)
Meta-Learning	COCO-20i -> Pascal VOC (1-shot)	Mean IoU	68.9	DGPNet (ResNet-50)
Meta-Learning	PASCAL-5i (10-Shot)	Mean IoU	77.7	DGPNet (ResNet-101)
Meta-Learning	COCO-20i (10-shot)	Mean IoU	60.2	DGPNet (ResNet-101)
Meta-Learning	PASCAL-5i (1-Shot)	Mean IoU	64.8	DGPNet (ResNet-101)
Meta-Learning	PASCAL-5i (1-Shot)	Mean IoU	63.5	DGPNet (ResNet-50)
Meta-Learning	COCO-20i (1-shot)	Mean IoU	46.7	DGPNet (ResNet-101)
Meta-Learning	COCO-20i (1-shot)	Mean IoU	45	DGPNet (ResNet-50)
Meta-Learning	PASCAL-5i (5-Shot)	Mean IoU	75.4	DGPNet (ResNet-101)
Meta-Learning	PASCAL-5i (5-Shot)	Mean IoU	73.5	DGPNet (ResNet-50)
Meta-Learning	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	78.5	DGPNet (ResNet-101)
Meta-Learning	COCO-20i -> Pascal VOC (5-shot)	Mean IoU	77.5	DGPNet (ResNet-50)

Dense Gaussian Processes for Few-Shot Segmentation

Abstract

Results

Related Papers

Dense Gaussian Processes for Few-Shot Segmentation

Abstract

Results

Related Papers