TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Masked Cross-image Encoding for Few-shot Segmentation

Masked Cross-image Encoding for Few-shot Segmentation

Wenbo Xu, Huaxi Huang, Ming Cheng, Litao Yu, Qiang Wu, Jian Zhang

2023-08-22Few-Shot Semantic Segmentation
PaperPDF

Abstract

Few-shot segmentation (FSS) is a dense prediction task that aims to infer the pixel-wise labels of unseen classes using only a limited number of annotated images. The key challenge in FSS is to classify the labels of query pixels using class prototypes learned from the few labeled support exemplars. Prior approaches to FSS have typically focused on learning class-wise descriptors independently from support images, thereby ignoring the rich contextual information and mutual dependencies among support-query features. To address this limitation, we propose a joint learning method termed Masked Cross-Image Encoding (MCE), which is designed to capture common visual properties that describe object details and to learn bidirectional inter-image dependencies that enhance feature interaction. MCE is more than a visual representation enrichment module; it also considers cross-image mutual dependencies and implicit guidance. Experiments on FSS benchmarks PASCAL-$5^i$ and COCO-$20^i$ demonstrate the advanced meta-learning ability of the proposed method.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)Mean IoU51.04MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU78.1MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU65.93MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU74.51MCE (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU62.87MCE (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU44.22MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU81.33MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU70.03MCE (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU78.2MCE (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU68.21MCE (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU51.04MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU78.1MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU65.93MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU74.51MCE (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU62.87MCE (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU44.22MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU81.33MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU70.03MCE (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU78.2MCE (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU68.21MCE (VGG-16)
Meta-LearningCOCO-20i (5-shot)Mean IoU51.04MCE (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU78.1MCE (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU65.93MCE (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU74.51MCE (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU62.87MCE (VGG-16)
Meta-LearningCOCO-20i (1-shot)Mean IoU44.22MCE (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU81.33MCE (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU70.03MCE (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU78.2MCE (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU68.21MCE (VGG-16)

Related Papers

Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation2025-06-09DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining2025-04-22FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images2025-04-01Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models2025-02-03AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies2025-01-28Overcoming Support Dilution for Robust Few-shot Semantic Segmentation2025-01-23Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks2025-01-17DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation2025-01-01