TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Self-Calibrated Cross Attention Network for Few-Shot Segme...

Self-Calibrated Cross Attention Network for Few-Shot Segmentation

Qianxiong Xu, Wenting Zhao, Guosheng Lin, Cheng Long

2023-08-18ICCV 2023 1Few-Shot Semantic Segmentation
PaperPDFCode(official)

Abstract

The key to the success of few-shot segmentation (FSS) lies in how to effectively utilize support samples. Most solutions compress support foreground (FG) features into prototypes, but lose some spatial details. Instead, others use cross attention to fuse query features with uncompressed support FG. Query FG could be fused with support FG, however, query background (BG) cannot find matched BG features in support FG, yet inevitably integrates dissimilar features. Besides, as both query FG and BG are combined with support FG, they get entangled, thereby leading to ineffective segmentation. To cope with these issues, we design a self-calibrated cross attention (SCCA) block. For efficient patch-based attention, query and support features are firstly split into patches. Then, we design a patch alignment module to align each query patch with its most similar support patch for better cross attention. Specifically, SCCA takes a query patch as Q, and groups the patches from the same query image and the aligned patches from the support image as K&V. In this way, the query BG features are fused with matched BG features (from query patches), and thus the aforementioned issues will be mitigated. Moreover, when calculating SCCA, we design a scaled-cosine mechanism to better utilize the support features for similarity calculation. Extensive experiments conducted on PASCAL-5^i and COCO-20^i demonstrate the superiority of our model, e.g., the mIoU score under 5-shot setting on COCO-20^i is 5.6%+ better than previous state-of-the-arts. The code is available at https://github.com/Sam1224/SCCAN.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)FB-IoU74.8SCCAN (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU57SCCAN (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)FB-IoU74.2SCCAN (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU53.9SCCAN (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU78.5SCCAN (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU68.3SCCAN (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU77.7SCCAN (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU66.8SCCAN (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU69.7SCCAN (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU48.2SCCAN (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU69.9SCCAN (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU46.3SCCAN (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU82.1SCCAN (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.5SCCAN (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU81.8SCCAN (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU70.3SCCAN (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU74.8SCCAN (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU57SCCAN (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU74.2SCCAN (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU53.9SCCAN (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU78.5SCCAN (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU68.3SCCAN (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU77.7SCCAN (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU66.8SCCAN (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU69.7SCCAN (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU48.2SCCAN (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU69.9SCCAN (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU46.3SCCAN (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU82.1SCCAN (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.5SCCAN (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU81.8SCCAN (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU70.3SCCAN (ResNet-50)
Meta-LearningCOCO-20i (5-shot)FB-IoU74.8SCCAN (ResNet-101)
Meta-LearningCOCO-20i (5-shot)Mean IoU57SCCAN (ResNet-101)
Meta-LearningCOCO-20i (5-shot)FB-IoU74.2SCCAN (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU53.9SCCAN (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU78.5SCCAN (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU68.3SCCAN (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU77.7SCCAN (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU66.8SCCAN (ResNet-50)
Meta-LearningCOCO-20i (1-shot)FB-IoU69.7SCCAN (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU48.2SCCAN (ResNet-101)
Meta-LearningCOCO-20i (1-shot)FB-IoU69.9SCCAN (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU46.3SCCAN (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU82.1SCCAN (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.5SCCAN (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU81.8SCCAN (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU70.3SCCAN (ResNet-50)

Related Papers

Adapter Naturally Serves as Decoupler for Cross-Domain Few-Shot Semantic Segmentation2025-06-09DINOv2-powered Few-Shot Semantic Segmentation: A Unified Framework via Cross-Model Distillation and 4D Correlation Mining2025-04-22FSSUWNet: Mitigating the Fragility of Pre-trained Models with Feature Enhancement for Few-Shot Semantic Segmentation in Underwater Images2025-04-01Exploring Few-Shot Defect Segmentation in General Industrial Scenarios with Metric Learning and Vision Foundation Models2025-02-03AdaSemSeg: An Adaptive Few-shot Semantic Segmentation of Seismic Facies2025-01-28Overcoming Support Dilution for Robust Few-shot Semantic Segmentation2025-01-23Few-shot Structure-Informed Machinery Part Segmentation with Foundation Models and Graph Neural Networks2025-01-17DSV-LFS: Unifying LLM-Driven Semantic Cues with Visual Features for Robust Few-Shot Segmentation2025-01-01