TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Cost Aggregation Is All You Need for Few-Shot Segmentation

Cost Aggregation Is All You Need for Few-Shot Segmentation

Sunghwan Hong, Seokju Cho, Jisu Nam, Seungryong Kim

2021-12-22Semantic correspondenceSegmentationFew-Shot Semantic SegmentationAll
PaperPDFCodeCode(official)

Abstract

We introduce a novel cost aggregation network, dubbed Volumetric Aggregation with Transformers (VAT), to tackle the few-shot segmentation task by using both convolutions and transformers to efficiently handle high dimensional correlation maps between query and support. In specific, we propose our encoder consisting of volume embedding module to not only transform the correlation maps into more tractable size but also inject some convolutional inductive bias and volumetric transformer module for the cost aggregation. Our encoder has a pyramidal structure to let the coarser level aggregation to guide the finer level and enforce to learn complementary matching scores. We then feed the output into our affinity-aware decoder along with the projected feature maps for guiding the segmentation process. Combining these components, we conduct experiments to demonstrate the effectiveness of the proposed method, and our method sets a new state-of-the-art for all the standard benchmarks in few-shot segmentation task. Furthermore, we find that the proposed method attains state-of-the-art performance even for the standard benchmarks in semantic correspondence task although not specifically designed for this task. We also provide an extensive ablation study to validate our architectural choices. The trained weights and codes are available at: https://seokju-cho.github.io/VAT/.

Results

TaskDatasetMetricValueModel
Few-Shot LearningFSS-1000 (5-shot)Mean IoU90.6VAT
Few-Shot LearningCOCO-20i (5-shot)Mean IoU47.9VAT (ResNet-50)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU90VAT
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU67.5VAT
Few-Shot LearningCOCO-20i (1-shot)Mean IoU41.3VAT (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.6VAT
Image MatchingSPair-71kPCK54.2VAT
Image MatchingPF-PASCALPCK92.3VAT
Image MatchingPF-WILLOWPCK81VAT
Few-Shot Semantic SegmentationFSS-1000 (5-shot)Mean IoU90.6VAT
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU47.9VAT (ResNet-50)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU90VAT
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU67.5VAT
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU41.3VAT (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.6VAT
Meta-LearningFSS-1000 (5-shot)Mean IoU90.6VAT
Meta-LearningCOCO-20i (5-shot)Mean IoU47.9VAT (ResNet-50)
Meta-LearningFSS-1000 (1-shot)Mean IoU90VAT
Meta-LearningPASCAL-5i (1-Shot)Mean IoU67.5VAT
Meta-LearningCOCO-20i (1-shot)Mean IoU41.3VAT (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.6VAT
Semantic correspondenceSPair-71kPCK54.2VAT
Semantic correspondencePF-PASCALPCK92.3VAT
Semantic correspondencePF-WILLOWPCK81VAT

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17