TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Few-Shot Segmentation Without Meta-Learning: A Good Transd...

Few-Shot Segmentation Without Meta-Learning: A Good Transductive Inference Is All You Need?

Malik Boudiaf, Hoel Kervadec, Ziko Imtiaz Masud, Pablo Piantanida, Ismail Ben Ayed, Jose Dolz

2020-12-11CVPR 2021 1Few-Shot Semantic SegmentationAll
PaperPDFCodeCode(official)

Abstract

We show that the way inference is performed in few-shot segmentation tasks has a substantial effect on performances -- an aspect often overlooked in the literature in favor of the meta-learning paradigm. We introduce a transductive inference for a given query image, leveraging the statistics of its unlabeled pixels, by optimizing a new loss containing three complementary terms: i) the cross-entropy on the labeled support pixels; ii) the Shannon entropy of the posteriors on the unlabeled query-image pixels; and iii) a global KL-divergence regularizer based on the proportion of the predicted foreground. As our inference uses a simple linear classifier of the extracted features, its computational load is comparable to inductive inference and can be used on top of any base training. Foregoing episodic training and using only standard cross-entropy training on the base classes, our inference yields competitive performances on standard benchmarks in the 1-shot scenarios. As the number of available shots increases, the gap in performances widens: on PASCAL-5i, our method brings about 5% and 6% improvements over the state-of-the-art, in the 5- and 10-shot scenarios, respectively. Furthermore, we introduce a new setting that includes domain shifts, where the base and novel classes are drawn from different datasets. Our method achieves the best performances in this more realistic setting. Our code is freely available online: https://github.com/mboudiaf/RePRI-for-Few-Shot-Segmentation.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)Mean IoU41.6RePRI (ResNet-50)
Few-Shot LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU63.1RePRI (ResNet-50)
Few-Shot LearningPASCAL-5i (10-Shot)Mean IoU68.1RePRI (ResNet-50)
Few-Shot LearningCOCO-20i (10-shot)Mean IoU44.1RePRI (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU59.7RePRI (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU59.4RePRI (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU34.1RePRI (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU66.6RePRI (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU65.6RePRI (ResNet-101)
Few-Shot LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU67.7RePRI (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU41.6RePRI (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (1-shot)Mean IoU63.1RePRI (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (10-Shot)Mean IoU68.1RePRI (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (10-shot)Mean IoU44.1RePRI (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU59.7RePRI (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU59.4RePRI (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU34.1RePRI (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU66.6RePRI (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU65.6RePRI (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i -> Pascal VOC (5-shot)Mean IoU67.7RePRI (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU41.6RePRI (ResNet-50)
Meta-LearningCOCO-20i -> Pascal VOC (1-shot)Mean IoU63.1RePRI (ResNet-50)
Meta-LearningPASCAL-5i (10-Shot)Mean IoU68.1RePRI (ResNet-50)
Meta-LearningCOCO-20i (10-shot)Mean IoU44.1RePRI (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU59.7RePRI (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU59.4RePRI (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU34.1RePRI (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU66.6RePRI (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU65.6RePRI (ResNet-101)
Meta-LearningCOCO-20i -> Pascal VOC (5-shot)Mean IoU67.7RePRI (ResNet-50)

Related Papers

Modeling Code: Is Text All You Need?2025-07-15All Eyes, no IMU: Learning Flight Attitude from Vision Alone2025-07-15Is Diversity All You Need for Scalable Robotic Manipulation?2025-07-08DESIGN AND IMPLEMENTATION OF ONLINE CLEARANCE REPORT.2025-07-07Is Reasoning All You Need? Probing Bias in the Age of Reasoning Language Models2025-07-03Prompt2SegCXR:Prompt to Segment All Organs and Diseases in Chest X-rays2025-07-01State and Memory is All You Need for Robust and Reliable AI Agents2025-06-30EAMamba: Efficient All-Around Vision State Space Model for Image Restoration2025-06-27