TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/APANet: Adaptive Prototypes Alignment Network for Few-Shot...

APANet: Adaptive Prototypes Alignment Network for Few-Shot Semantic Segmentation

Jiacheng Chen, Bin-Bin Gao, Zongqing Lu, Jing-Hao Xue, Chengjie Wang, Qingmin Liao

2021-11-24Metric LearningSegmentationFew-Shot Semantic SegmentationSemantic Segmentation
PaperPDF

Abstract

Few-shot semantic segmentation aims to segment novel-class objects in a given query image with only a few labeled support images. Most advanced solutions exploit a metric learning framework that performs segmentation through matching each query feature to a learned class-specific prototype. However, this framework suffers from biased classification due to incomplete feature comparisons. To address this issue, we present an adaptive prototype representation by introducing class-specific and class-agnostic prototypes and thus construct complete sample pairs for learning semantic alignment with query features. The complementary features learning manner effectively enriches feature comparison and helps yield an unbiased segmentation model in the few-shot setting. It is implemented with a two-branch end-to-end network (i.e., a class-specific branch and a class-agnostic branch), which generates prototypes and then combines query features to perform comparisons. In addition, the proposed class-agnostic branch is simple yet effective. In practice, it can adaptively generate multiple class-agnostic prototypes for query images and learn feature alignment in a self-contrastive manner. Extensive experiments on PASCAL-5$^i$ and COCO-20$^i$ demonstrate the superiority of our method. At no expense of inference efficiency, our model achieves state-of-the-art results in both 1-shot and 5-shot settings for semantic segmentation.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)Mean IoU46.4APANet (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU43.2APANet (VGG-16)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU43APANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU64APANet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU63APANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU59APANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU41.9APANet (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU40.5APANet (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU37.2APANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU68APANet (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU66APANet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU62.6APANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU46.4APANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU43.2APANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU43APANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU64APANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU63APANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU59APANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU41.9APANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU40.5APANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU37.2APANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU68APANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU66APANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU62.6APANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)Mean IoU46.4APANet (ResNet-101)
Meta-LearningCOCO-20i (5-shot)Mean IoU43.2APANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)Mean IoU43APANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU64APANet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU63APANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU59APANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)Mean IoU41.9APANet (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU40.5APANet (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU37.2APANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU68APANet (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU66APANet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU62.6APANet (VGG-16)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Unsupervised Ground Metric Learning2025-07-17Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17