TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

Weimin Tan, Siyuan Chen, Bo Yan

2023-07-03SegmentationFew-Shot Semantic SegmentationSemantic SegmentationImage Generation
PaperPDFCode(official)

Abstract

Diffusion models have demonstrated excellent performance in image generation. Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a bottleneck. This paper presents the first work to leverage the diffusion model for FSS task, called DifFSS. DifFSS, a novel FSS paradigm, can further improve the performance of the state-of-the-art FSS models by a large margin without modifying their network structure. Specifically, we utilize the powerful generation ability of diffusion models to generate diverse auxiliary support images by using the semantic mask, scribble or soft HED boundary of the support image as control conditions. This generation process simulates the variety within the class of the query image, such as color, texture variation, lighting, $etc$. As a result, FSS models can refer to more diverse support images, yielding more robust representations, thereby achieving a consistent improvement in segmentation performance. Extensive experiments on three publicly available datasets based on existing advanced FSS models demonstrate the effectiveness of the diffusion model for FSS task. Furthermore, we explore in detail the impact of different input settings of the diffusion model on segmentation performance. Hopefully, this completely new paradigm will bring inspiration to the study of FSS task integrated with AI-generated content. Code is available at https://github.com/TrinitialChan/DifFSS

Results

TaskDatasetMetricValueModel
Few-Shot LearningFSS-1000 (1-shot)Mean IoU88.4DCAMA (DifFSS, ResNet-50)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU86.2HSNet (DifFSS, ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU70.2HDMNet (DifFSS, ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU69.3BAM (DifFSS, ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU66.2CyCTR (DifFSS, ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU46.7HDMNet (DifFSS, ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU43.6BAM (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU88.4DCAMA (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU86.2HSNet (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU70.2HDMNet (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU69.3BAM (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU66.2CyCTR (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU46.7HDMNet (DifFSS, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU43.6BAM (DifFSS, ResNet-50)
Meta-LearningFSS-1000 (1-shot)Mean IoU88.4DCAMA (DifFSS, ResNet-50)
Meta-LearningFSS-1000 (1-shot)Mean IoU86.2HSNet (DifFSS, ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU70.2HDMNet (DifFSS, ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU69.3BAM (DifFSS, ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU66.2CyCTR (DifFSS, ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU46.7HDMNet (DifFSS, ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU43.6BAM (DifFSS, ResNet-50)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17