TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/PANet: Few-Shot Image Semantic Segmentation with Prototype...

PANet: Few-Shot Image Semantic Segmentation with Prototype Alignment

Kaixin Wang, Jun Hao Liew, Yingtian Zou, Daquan Zhou, Jiashi Feng

2019-08-18ICCV 2019 10Metric LearningSegmentationFew-Shot Semantic SegmentationSemantic Segmentation
PaperPDFCodeCodeCode(official)CodeCode

Abstract

Despite the great progress made by deep CNNs in image semantic segmentation, they typically require a large number of densely-annotated images for training and are difficult to generalize to unseen object categories. Few-shot segmentation has thus been developed to learn to perform segmentation from only a few annotated examples. In this paper, we tackle the challenging few-shot segmentation problem from a metric learning perspective and present PANet, a novel prototype alignment network to better utilize the information of the support set. Our PANet learns class-specific prototype representations from a few support images within an embedding space and then performs segmentation over the query images through matching each pixel to the learned prototypes. With non-parametric metric learning, PANet offers high-quality prototypes that are representative for each semantic class and meanwhile discriminative for different classes. Moreover, PANet introduces a prototype alignment regularization between support and query. With this, PANet fully exploits knowledge from the support and provides better generalization on few-shot segmentation. Significantly, our model achieves the mIoU score of 48.1% and 55.7% on PASCAL-5i for 1-shot and 5-shot settings respectively, surpassing the state-of-the-art method by 1.8% and 8.6%.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)FB-IoU63.5PANet (VGG-16)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU29.7PANet (VGG-16)
Few-Shot LearningCOCO-20i (2-way 1-shot)mIoU18PANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU66.5PANet (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU48.1PANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU59.2PANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU20.9PANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU70.7PANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU55.7PANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU63.5PANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU29.7PANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (2-way 1-shot)mIoU18PANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU66.5PANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU48.1PANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU59.2PANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU20.9PANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU70.7PANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU55.7PANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)FB-IoU63.5PANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)Mean IoU29.7PANet (VGG-16)
Meta-LearningCOCO-20i (2-way 1-shot)mIoU18PANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU66.5PANet (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU48.1PANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)FB-IoU59.2PANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)Mean IoU20.9PANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU70.7PANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU55.7PANet (VGG-16)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Unsupervised Ground Metric Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17