TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Dense Gaussian Processes for Few-Shot Segmentation

Dense Gaussian Processes for Few-Shot Segmentation

Joakim Johnander, Johan Edstedt, Michael Felsberg, Fahad Shahbaz Khan, Martin Danelljan

2021-10-07Gaussian ProcessesSegmentationFew-Shot Semantic Segmentation
PaperPDFCode(official)

Abstract

Few-shot segmentation is a challenging dense prediction task, which entails segmenting a novel query image given only a small annotated support set. The key problem is thus to design a method that aggregates detailed information from the support set, while being robust to large variations in appearance and context. To this end, we propose a few-shot segmentation method based on dense Gaussian process (GP) regression. Given the support set, our dense GP learns the mapping from local deep image features to mask values, capable of capturing complex appearance distributions. Furthermore, it provides a principled means of capturing uncertainty, which serves as another powerful cue for the final segmentation, obtained by a CNN decoder. Instead of a one-dimensional mask output, we further exploit the end-to-end learning capabilities of our approach to learn a high-dimensional output space for the GP. Our approach sets a new state-of-the-art on the PASCAL-5$^i$ and COCO-20$^i$ benchmarks, achieving an absolute gain of $+8.4$ mIoU in the COCO-20$^i$ 5-shot setting. Furthermore, the segmentation quality of our approach scales gracefully when increasing the support set size, while achieving robust cross-dataset transfer. Code and trained models are available at \url{https://github.com/joakimjohnander/dgpnet}.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)Mean IoU57.9DGPNet (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU56.2DGPNet (ResNet-50)
Few-Shot LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU70.1DGPNet (ResNet-101)
Few-Shot LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU68.9DGPNet (ResNet-50)
Few-Shot LearningPASCAL-5i (10-Shot)Mean IoU77.7DGPNet (ResNet-101)
Few-Shot LearningCOCO-20i (10-shot)Mean IoU60.2DGPNet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU64.8DGPNet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU63.5DGPNet (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU46.7DGPNet (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU45DGPNet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU75.4DGPNet (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU73.5DGPNet (ResNet-50)
Few-Shot LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU78.5DGPNet (ResNet-101)
Few-Shot LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU77.5DGPNet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU57.9DGPNet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU56.2DGPNet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (1-shot)Mean IoU70.1DGPNet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (1-shot)Mean IoU68.9DGPNet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (10-Shot)Mean IoU77.7DGPNet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (10-shot)Mean IoU60.2DGPNet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU64.8DGPNet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU63.5DGPNet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU46.7DGPNet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU45DGPNet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU75.4DGPNet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU73.5DGPNet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (5-shot)Mean IoU78.5DGPNet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (5-shot)Mean IoU77.5DGPNet (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU57.9DGPNet (ResNet-101)
Meta-LearningCOCO-20i (5-shot)Mean IoU56.2DGPNet (ResNet-50)
Meta-LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU70.1DGPNet (ResNet-101)
Meta-LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU68.9DGPNet (ResNet-50)
Meta-LearningPASCAL-5i (10-Shot)Mean IoU77.7DGPNet (ResNet-101)
Meta-LearningCOCO-20i (10-shot)Mean IoU60.2DGPNet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU64.8DGPNet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU63.5DGPNet (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU46.7DGPNet (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU45DGPNet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU75.4DGPNet (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU73.5DGPNet (ResNet-50)
Meta-LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU78.5DGPNet (ResNet-101)
Meta-LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU77.5DGPNet (ResNet-50)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17