TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MIANet: Aggregating Unbiased Instance and General Informat...

MIANet: Aggregating Unbiased Instance and General Information for Few-Shot Semantic Segmentation

Yong Yang, Qiong Chen, Yuan Feng, Tianlin Huang

2023-05-23CVPR 2023 1Meta-LearningFew-Shot Semantic SegmentationSemantic SegmentationWord EmbeddingsGeneral Knowledge
PaperPDFCode(official)

Abstract

Existing few-shot segmentation methods are based on the meta-learning strategy and extract instance knowledge from a support set and then apply the knowledge to segment target objects in a query set. However, the extracted knowledge is insufficient to cope with the variable intra-class differences since the knowledge is obtained from a few samples in the support set. To address the problem, we propose a multi-information aggregation network (MIANet) that effectively leverages the general knowledge, i.e., semantic word embeddings, and instance information for accurate segmentation. Specifically, in MIANet, a general information module (GIM) is proposed to extract a general class prototype from word embeddings as a supplement to instance information. To this end, we design a triplet loss that treats the general class prototype as an anchor and samples positive-negative pairs from local features in the support set. The calculated triplet loss can transfer semantic similarities among language identities from a word embedding space to a visual representation space. To alleviate the model biasing towards the seen training classes and to obtain multi-scale information, we then introduce a non-parametric hierarchical prior module (HPM) to generate unbiased instance-level information via calculating the pixel-level similarity between the support and query image features. Finally, an information fusion module (IFM) combines the general and instance information to make predictions for the query image. Extensive experiments on PASCAL-5i and COCO-20i show that MIANet yields superior performance and set a new state-of-the-art. Code is available at https://github.com/Aldrich2y/MIANet.

Results

TaskDatasetMetricValueModel
Few-Shot LearningCOCO-20i (5-shot)FB-IoU73.13MIANet (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU51.65MIANet (ResNet-50)
Few-Shot LearningCOCO-20i (5-shot)FB-IoU73.81MIANet (VGG-16)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU51.03MIANet (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU79.54MIANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU68.72MIANet (ResNet-50)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU67.63MIANet (ResNet-101)
Few-Shot LearningPASCAL-5i (1-Shot)FB-IoU79.22MIANet (VGG-16)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU67.1MIANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU71.51MIANet (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU47.66MIANet (ResNet-50)
Few-Shot LearningCOCO-20i (1-shot)FB-IoU71.01MIANet (VGG-16)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU45.69MIANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU82.69MIANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.99MIANet (VGG-16)
Few-Shot LearningPASCAL-5i (5-Shot)FB-IoU82.2MIANet (ResNet-50)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU71.59MIANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU73.13MIANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU51.65MIANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)FB-IoU73.81MIANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU51.03MIANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU79.54MIANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU68.72MIANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU67.63MIANet (ResNet-101)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)FB-IoU79.22MIANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU67.1MIANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU71.51MIANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU47.66MIANet (ResNet-50)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)FB-IoU71.01MIANet (VGG-16)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU45.69MIANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU82.69MIANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.99MIANet (VGG-16)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)FB-IoU82.2MIANet (ResNet-50)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU71.59MIANet (ResNet-50)
Meta-LearningCOCO-20i (5-shot)FB-IoU73.13MIANet (ResNet-50)
Meta-LearningCOCO-20i (5-shot)Mean IoU51.65MIANet (ResNet-50)
Meta-LearningCOCO-20i (5-shot)FB-IoU73.81MIANet (VGG-16)
Meta-LearningCOCO-20i (5-shot)Mean IoU51.03MIANet (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU79.54MIANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU68.72MIANet (ResNet-50)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU67.63MIANet (ResNet-101)
Meta-LearningPASCAL-5i (1-Shot)FB-IoU79.22MIANet (VGG-16)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU67.1MIANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)FB-IoU71.51MIANet (ResNet-50)
Meta-LearningCOCO-20i (1-shot)Mean IoU47.66MIANet (ResNet-50)
Meta-LearningCOCO-20i (1-shot)FB-IoU71.01MIANet (VGG-16)
Meta-LearningCOCO-20i (1-shot)Mean IoU45.69MIANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU82.69MIANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.99MIANet (VGG-16)
Meta-LearningPASCAL-5i (5-Shot)FB-IoU82.2MIANet (ResNet-50)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU71.59MIANet (ResNet-50)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17Are encoders able to learn landmarkers for warm-starting of Hyperparameter Optimization?2025-07-16Imbalanced Regression Pipeline Recommendation2025-07-16CLID-MU: Cross-Layer Information Divergence Based Meta Update Strategy for Learning with Noisy Labels2025-07-16