TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Doubly Deformable Aggregation of Covariance Matrices for F...

Doubly Deformable Aggregation of Covariance Matrices for Few-shot Segmentation

Zhitong Xiong, Haopeng Li, Xiao Xiang Zhu

2022-07-30Semantic correspondenceSegmentationFew-Shot Semantic SegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

Training semantic segmentation models with few annotated samples has great potential in various real-world applications. For the few-shot segmentation task, the main challenge is how to accurately measure the semantic correspondence between the support and query samples with limited training data. To address this problem, we propose to aggregate the learnable covariance matrices with a deformable 4D Transformer to effectively predict the segmentation map. Specifically, in this work, we first devise a novel hard example mining mechanism to learn covariance kernels for the Gaussian process. The learned covariance kernel functions have great advantages over existing cosine similarity-based methods in correspondence measurement. Based on the learned covariance kernels, an efficient doubly deformable 4D Transformer module is designed to adaptively aggregate feature similarity maps into segmentation results. By combining these two designs, the proposed method can not only set new state-of-the-art performance on public benchmarks, but also converge extremely faster than existing methods. Experiments on three public datasets have demonstrated the effectiveness of our method.

Results

TaskDatasetMetricValueModel
Few-Shot LearningFSS-1000 (5-shot)Mean IoU91.7DACM (ResNet-101)
Few-Shot LearningFSS-1000 (5-shot)Mean IoU91.6DACM (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)FB-IoU72.9DACM (VAT, ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU49.2DACM (VAT, ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)FB-IoU71.6DACM (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU48.1DACM (ResNet-50)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU90.8DACM (ResNet-101)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU90.7DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU79.4DACM (VAT, ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU69.1DACM (VAT, ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU78.9DACM (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU67.5DACM (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU78.6DACM (VAT, ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU66.8DACM (VAT, ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU77.8DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU65.7DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU75.5DACM (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU61.8DACM (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU69.4DACM (VAT, ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU43DACM (VAT, ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU68.9DACM (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU40.6DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU83.1DACM (VAT, ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU73.3DACM (VAT, ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU81.7DACM (VAT, ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.7DACM (VAT, ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU81.5DACM (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.4DACM (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU81.3DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU70.9DACM (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU77.8DACM (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU65.7DACM (VGG-16)
Few-Shot Semantic SegmentationFSS-1000 (5-shot)Mean IoU91.7DACM (ResNet-101)
Few-Shot Semantic SegmentationFSS-1000 (5-shot)Mean IoU91.6DACM (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU72.9DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU49.2DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU71.6DACM (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU48.1DACM (ResNet-50)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU90.8DACM (ResNet-101)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU90.7DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU79.4DACM (VAT, ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU69.1DACM (VAT, ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU78.9DACM (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU67.5DACM (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU78.6DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU66.8DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU77.8DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU65.7DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU75.5DACM (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU61.8DACM (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU69.4DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU43DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU68.9DACM (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU40.6DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU83.1DACM (VAT, ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU73.3DACM (VAT, ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU81.7DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.7DACM (VAT, ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU81.5DACM (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.4DACM (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU81.3DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU70.9DACM (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU77.8DACM (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU65.7DACM (VGG-16)
Meta-LearningFSS-1000 (5-shot)Mean IoU91.7DACM (ResNet-101)
Meta-LearningFSS-1000 (5-shot)Mean IoU91.6DACM (ResNet-50)
Meta-LearningCOCO-20i (5-shot)FB-IoU72.9DACM (VAT, ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU49.2DACM (VAT, ResNet-50)
Meta-LearningCOCO-20i (5-shot)FB-IoU71.6DACM (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU48.1DACM (ResNet-50)
Meta-LearningFSS-1000 (1-shot)Mean IoU90.8DACM (ResNet-101)
Meta-LearningFSS-1000 (1-shot)Mean IoU90.7DACM (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU79.4DACM (VAT, ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU69.1DACM (VAT, ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU78.9DACM (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU67.5DACM (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU78.6DACM (VAT, ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU66.8DACM (VAT, ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU77.8DACM (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU65.7DACM (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU75.5DACM (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU61.8DACM (VGG-16)
Meta-LearningCOCO-20i (1-shot)FB-IoU69.4DACM (VAT, ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU43DACM (VAT, ResNet-50)
Meta-LearningCOCO-20i (1-shot)FB-IoU68.9DACM (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU40.6DACM (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU83.1DACM (VAT, ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU73.3DACM (VAT, ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU81.7DACM (VAT, ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.7DACM (VAT, ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU81.5DACM (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.4DACM (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU81.3DACM (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU70.9DACM (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU77.8DACM (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU65.7DACM (VGG-16)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17