TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Bridge the Points: Graph-based Few-shot Segment Anything S...

Bridge the Points: Graph-based Few-shot Segment Anything Semantically

Anqi Zhang, Guangyu Gao, Jianbo Jiao, Chi Harold Liu, Yunchao Wei

2024-10-09Few-Shot Semantic SegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

The recent advancements in large-scale pre-training techniques have significantly enhanced the capabilities of vision foundation models, notably the Segment Anything Model (SAM), which can generate precise masks based on point and box prompts. Recent studies extend SAM to Few-shot Semantic Segmentation (FSS), focusing on prompt generation for SAM-based automatic semantic segmentation. However, these methods struggle with selecting suitable prompts, require specific hyperparameter settings for different scenarios, and experience prolonged one-shot inference times due to the overuse of SAM, resulting in low efficiency and limited automation ability. To address these issues, we propose a simple yet effective approach based on graph analysis. In particular, a Positive-Negative Alignment module dynamically selects the point prompts for generating masks, especially uncovering the potential of the background context as the negative reference. Another subsequent Point-Mask Clustering module aligns the granularity of masks and selected points as a directed graph, based on mask coverage over points. These points are then aggregated by decomposing the weakly connected components of the directed graph in an efficient manner, constructing distinct natural clusters. Finally, the positive and overshooting gating, benefiting from graph-based granularity alignment, aggregate high-confident masks and filter out the false-positive masks for final prediction, reducing the usage of additional hyperparameters and redundant mask generation. Extensive experimental analysis across standard FSS, One-shot Part Segmentation, and Cross Domain FSS datasets validate the effectiveness and efficiency of the proposed approach, surpassing state-of-the-art generalist models with a mIoU of 58.7% on COCO-20i and 35.2% on LVIS-92i. The code is available in https://andyzaq.github.io/GF-SAM/.

Results

TaskDatasetMetricValueModel
Few-Shot LearningFSS-1000 (5-shot)Mean IoU88.9GF-SAM (DINOv2)
Few-Shot LearningCOCO-20i (5-shot)Mean IoU66.8GF-SAM (DINOv2)
Few-Shot LearningFSS-1000 (1-shot)Mean IoU88GF-SAM (DINOv2)
Few-Shot LearningPASCAL-5i (1-Shot)Mean IoU72.1GF-SAM (DINOv2)
Few-Shot LearningCOCO-20i (1-shot)Mean IoU58.7GF-SAM (DINOv2)
Few-Shot LearningPASCAL-5i (5-Shot)Mean IoU82.6GF-SAM (DINOv2)
Few-Shot Semantic SegmentationFSS-1000 (5-shot)Mean IoU88.9GF-SAM (DINOv2)
Few-Shot Semantic SegmentationCOCO-20i (5-shot)Mean IoU66.8GF-SAM (DINOv2)
Few-Shot Semantic SegmentationFSS-1000 (1-shot)Mean IoU88GF-SAM (DINOv2)
Few-Shot Semantic SegmentationPASCAL-5i (1-Shot)Mean IoU72.1GF-SAM (DINOv2)
Few-Shot Semantic SegmentationCOCO-20i (1-shot)Mean IoU58.7GF-SAM (DINOv2)
Few-Shot Semantic SegmentationPASCAL-5i (5-Shot)Mean IoU82.6GF-SAM (DINOv2)
Meta-LearningFSS-1000 (5-shot)Mean IoU88.9GF-SAM (DINOv2)
Meta-LearningCOCO-20i (5-shot)Mean IoU66.8GF-SAM (DINOv2)
Meta-LearningFSS-1000 (1-shot)Mean IoU88GF-SAM (DINOv2)
Meta-LearningPASCAL-5i (1-Shot)Mean IoU72.1GF-SAM (DINOv2)
Meta-LearningCOCO-20i (1-shot)Mean IoU58.7GF-SAM (DINOv2)
Meta-LearningPASCAL-5i (5-Shot)Mean IoU82.6GF-SAM (DINOv2)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15