TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Mining Latent Classes for Few-shot Segmentation

Mining Latent Classes for Few-shot Segmentation

Lihe Yang, Wei Zhuo, Lei Qi, Yinghuan Shi, Yang Gao

2021-03-29ICCV 2021 10Few-Shot Semantic Segmentation
PaperPDFCode(official)

Abstract

Few-shot segmentation (FSS) aims to segment unseen classes given only a few annotated samples. Existing methods suffer the problem of feature undermining, i.e. potential novel classes are treated as background during training phase. Our method aims to alleviate this problem and enhance the feature embedding on latent novel classes. In our work, we propose a novel joint-training framework. Based on conventional episodic training on support-query pairs, we add an additional mining branch that exploits latent novel classes via transferable sub-clusters, and a new rectification technique on both background and foreground categories to enforce more stable prototypes. Over and above that, our transferable sub-cluster has the ability to leverage extra unlabeled data for further feature enhancement. Extensive experiments on two FSS benchmarks demonstrate that our method outperforms previous state-of-the-art by a large margin of 3.7% mIOU on PASCAL-5i and 7.0% mIOU on COCO-20i at the cost of 74% fewer parameters and 2.5x faster inference speed. The source code is available at https://github.com/LiheYoung/MiningFSS.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)Mean IoU45.1MLC (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU41.4MLC (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU63.8MLC (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU63.6MLC (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU37.5MLC (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU35.1MLC (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU69.3MLC (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU66.8MLC (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU45.1MLC (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU41.4MLC (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU63.8MLC (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU63.6MLC (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU37.5MLC (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU35.1MLC (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU69.3MLC (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU66.8MLC (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU45.1MLC (ResNet-101)
Meta-LearningCOCO-20i (5-shot)Mean IoU41.4MLC (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU63.8MLC (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU63.6MLC (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU37.5MLC (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU35.1MLC (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU69.3MLC (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU66.8MLC (ResNet-50)

Related Papers

Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation2025-06-09DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining2025-04-22FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images2025-04-01Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models2025-02-03AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies2025-01-28Overcoming Support Dilution for Robust Few-shot Semantic Segmentation2025-01-23Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks2025-01-17DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation2025-01-01