TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MSANet: Multi-Similarity and Attention Guidance for Boosti...

MSANet: Multi-Similarity and Attention Guidance for Boosting Few-Shot Segmentation

Ehtesham Iqbal, Sirojbek Safarov, Seongdeok Bang

2022-06-20Meta-LearningFew-Shot Semantic SegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

Few-shot segmentation aims to segment unseen-class objects given only a handful of densely labeled samples. Prototype learning, where the support feature yields a singleor several prototypes by averaging global and local object information, has been widely used in FSS. However, utilizing only prototype vectors may be insufficient to represent the features for all training data. To extract abundant features and make more precise predictions, we propose a Multi-Similarity and Attention Network (MSANet) including two novel modules, a multi-similarity module and an attention module. The multi-similarity module exploits multiple feature-maps of support images and query images to estimate accurate semantic relationships. The attention module instructs the network to concentrate on class-relevant information. The network is tested on standard FSS datasets, PASCAL-5i 1-shot, PASCAL-5i 5-shot, COCO-20i 1-shot, and COCO-20i 5-shot. The MSANet with the backbone of ResNet-101 achieves the state-of-the-art performance for all 4-benchmark datasets with mean intersection over union (mIoU) of 69.13%, 73.99%, 51.09%, 56.80%, respectively. Code is available at https://github.com/AIVResearch/MSANet

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)FB-IoU56.8MSANet (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU56.3MSANet (ResNet-101)
Few-Shot LearningCOCO-20i (5-shot)FB-IoU53.67MSANet (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU50.47MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU80.38MSANet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU69.13MSANet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU80.44MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU68.52MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU78.01MSANet (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU65.76MSANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU51.09MSANet (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU50.45MSANet (ResNet-101)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU48.03MSANet (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU46.44MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU84.3MSANet (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU73.99MSANet (ResNet-101)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU83.23MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU72.6MSANet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU80.5MSANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU70.4MSANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU56.8MSANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU56.3MSANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU53.67MSANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU50.47MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU80.38MSANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU69.13MSANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU80.44MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU68.52MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU78.01MSANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU65.76MSANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU51.09MSANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU50.45MSANet (ResNet-101)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU48.03MSANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU46.44MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU84.3MSANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU73.99MSANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU83.23MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU72.6MSANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU80.5MSANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU70.4MSANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)FB-IoU56.8MSANet (ResNet-101)
Meta-LearningCOCO-20i (5-shot)Mean IoU56.3MSANet (ResNet-101)
Meta-LearningCOCO-20i (5-shot)FB-IoU53.67MSANet (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU50.47MSANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU80.38MSANet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU69.13MSANet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU80.44MSANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU68.52MSANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU78.01MSANet (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU65.76MSANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)FB-IoU51.09MSANet (ResNet-101)
Meta-LearningCOCO-20i (1-shot)Mean IoU50.45MSANet (ResNet-101)
Meta-LearningCOCO-20i (1-shot)FB-IoU48.03MSANet (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU46.44MSANet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU84.3MSANet (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU73.99MSANet (ResNet-101)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU83.23MSANet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU72.6MSANet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU80.5MSANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU70.4MSANet (VGG-16)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16