TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/WeakSAM: Segment Anything Meets Weakly-supervised Instance...

WeakSAM: Segment Anything Meets Weakly-supervised Instance-level Recognition

Lianghui Zhu, Junwei Zhou, Yan Liu, Xin Hao, Wenyu Liu, Xinggang Wang

2024-02-22Weakly Supervised Object DetectionSegmentationWorld KnowledgeImage-level Supervised Instance Segmentationobject-detectionObject Detection
PaperPDFCode(official)

Abstract

Weakly supervised visual recognition using inexact supervision is a critical yet challenging learning problem. It significantly reduces human labeling costs and traditionally relies on multi-instance learning and pseudo-labeling. This paper introduces WeakSAM and solves the weakly-supervised object detection (WSOD) and segmentation by utilizing the pre-learned world knowledge contained in a vision foundation model, i.e., the Segment Anything Model (SAM). WeakSAM addresses two critical limitations in traditional WSOD retraining, i.e., pseudo ground truth (PGT) incompleteness and noisy PGT instances, through adaptive PGT generation and Region of Interest (RoI) drop regularization. It also addresses the SAM's problems of requiring prompts and category unawareness for automatic object detection and segmentation. Our results indicate that WeakSAM significantly surpasses previous state-of-the-art methods in WSOD and WSIS benchmarks with large margins, i.e. average improvements of 7.4% and 8.5%, respectively. The code is available at \url{https://github.com/hustvl/WeakSAM}.

Results

TaskDatasetMetricValueModel
Object DetectionMS-COCO-2014AP26.6WeakSAM-MIST-DINO (with SAM)
Object DetectionMS-COCO-2014AP24.9WeakSAM-OICR-DINO (with SAM)
Object DetectionMS-COCO-2014AP23.8WeakSAM-MIST-Faster RCNN (with SAM)
Object DetectionMS-COCO-2014AP22.9WeakSAM-MIST (with SAM)
Object DetectionMS-COCO-2014AP22.3WeakSAM-OICR-Faster RCNN (with SAM)
Object DetectionMS-COCO-2014AP19.9WeakSAM-OICR (with SAM)
Object DetectionPASCAL VOC 2007MAP73.4WeakSAM-MIST-DINO (with SAM)
Object DetectionPASCAL VOC 2007MAP71.8WeakSAM-MIST-Faster RCNN (with SAM)
Object DetectionPASCAL VOC 2007MAP67.4WeakSAM-MIST (with SAM)
Object DetectionPASCAL VOC 2007MAP66.1WeakSAM-OICR-DINO (with SAM)
Object DetectionPASCAL VOC 2007MAP65.7WeakSAM-OICR-Faster RCNN (with SAM)
Object DetectionPASCAL VOC 2007MAP58.9WeakSAM-OICR (with SAM)
Object DetectionPASCAL VOC 2012 testMAP70.2WeakSAM-MIST-DINO (with SAM)
Object DetectionPASCAL VOC 2012 testMAP69.2WeakSAM-MIST-Faster RCNN (with SAM)
Object DetectionPASCAL VOC 2012 testMAP66.9WeakSAM-MIST (with SAM)
Object DetectionPASCAL VOC 2012 testMAP63.7WeakSAM-OICR-DINO (with SAM)
Object DetectionPASCAL VOC 2012 testMAP62.9WeakSAM-OICR-Faster RCNN (with SAM)
Object DetectionPASCAL VOC 2012 testMAP58.4WeakSAM-OICR (with SAM)
3DMS-COCO-2014AP26.6WeakSAM-MIST-DINO (with SAM)
3DMS-COCO-2014AP24.9WeakSAM-OICR-DINO (with SAM)
3DMS-COCO-2014AP23.8WeakSAM-MIST-Faster RCNN (with SAM)
3DMS-COCO-2014AP22.9WeakSAM-MIST (with SAM)
3DMS-COCO-2014AP22.3WeakSAM-OICR-Faster RCNN (with SAM)
3DMS-COCO-2014AP19.9WeakSAM-OICR (with SAM)
3DPASCAL VOC 2007MAP73.4WeakSAM-MIST-DINO (with SAM)
3DPASCAL VOC 2007MAP71.8WeakSAM-MIST-Faster RCNN (with SAM)
3DPASCAL VOC 2007MAP67.4WeakSAM-MIST (with SAM)
3DPASCAL VOC 2007MAP66.1WeakSAM-OICR-DINO (with SAM)
3DPASCAL VOC 2007MAP65.7WeakSAM-OICR-Faster RCNN (with SAM)
3DPASCAL VOC 2007MAP58.9WeakSAM-OICR (with SAM)
3DPASCAL VOC 2012 testMAP70.2WeakSAM-MIST-DINO (with SAM)
3DPASCAL VOC 2012 testMAP69.2WeakSAM-MIST-Faster RCNN (with SAM)
3DPASCAL VOC 2012 testMAP66.9WeakSAM-MIST (with SAM)
3DPASCAL VOC 2012 testMAP63.7WeakSAM-OICR-DINO (with SAM)
3DPASCAL VOC 2012 testMAP62.9WeakSAM-OICR-Faster RCNN (with SAM)
3DPASCAL VOC 2012 testMAP58.4WeakSAM-OICR (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.2573.4WeakSAM-Mask2Former (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.564.4WeakSAM-Mask2Former (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.749.7WeakSAM-Mask2Former (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.7545.3WeakSAM-Mask2Former (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.2570.3WeakSAM-Mask RCNN (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.559.6WeakSAM-Mask RCNN (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.743.1WeakSAM-Mask RCNN (with SAM)
Instance SegmentationPASCAL VOC 2012 valmAP@0.7536.2WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO 2017 valAP25.2WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO 2017 valAP@5038.4WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO 2017 valAP@7527WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO 2017 valAP20.6WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO 2017 valAP@5033.9WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO 2017 valAP@7522WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO test-devAP25.9WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO test-devAP@5039.9WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO test-devAP@7527.9WeakSAM-Mask2Former (with SAM)
Instance SegmentationCOCO test-devAP21WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO test-devAP@5034.5WeakSAM-Mask RCNN (with SAM)
Instance SegmentationCOCO test-devAP@7522.2WeakSAM-Mask RCNN (with SAM)
2D ClassificationMS-COCO-2014AP26.6WeakSAM-MIST-DINO (with SAM)
2D ClassificationMS-COCO-2014AP24.9WeakSAM-OICR-DINO (with SAM)
2D ClassificationMS-COCO-2014AP23.8WeakSAM-MIST-Faster RCNN (with SAM)
2D ClassificationMS-COCO-2014AP22.9WeakSAM-MIST (with SAM)
2D ClassificationMS-COCO-2014AP22.3WeakSAM-OICR-Faster RCNN (with SAM)
2D ClassificationMS-COCO-2014AP19.9WeakSAM-OICR (with SAM)
2D ClassificationPASCAL VOC 2007MAP73.4WeakSAM-MIST-DINO (with SAM)
2D ClassificationPASCAL VOC 2007MAP71.8WeakSAM-MIST-Faster RCNN (with SAM)
2D ClassificationPASCAL VOC 2007MAP67.4WeakSAM-MIST (with SAM)
2D ClassificationPASCAL VOC 2007MAP66.1WeakSAM-OICR-DINO (with SAM)
2D ClassificationPASCAL VOC 2007MAP65.7WeakSAM-OICR-Faster RCNN (with SAM)
2D ClassificationPASCAL VOC 2007MAP58.9WeakSAM-OICR (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP70.2WeakSAM-MIST-DINO (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP69.2WeakSAM-MIST-Faster RCNN (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP66.9WeakSAM-MIST (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP63.7WeakSAM-OICR-DINO (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP62.9WeakSAM-OICR-Faster RCNN (with SAM)
2D ClassificationPASCAL VOC 2012 testMAP58.4WeakSAM-OICR (with SAM)
2D Object DetectionMS-COCO-2014AP26.6WeakSAM-MIST-DINO (with SAM)
2D Object DetectionMS-COCO-2014AP24.9WeakSAM-OICR-DINO (with SAM)
2D Object DetectionMS-COCO-2014AP23.8WeakSAM-MIST-Faster RCNN (with SAM)
2D Object DetectionMS-COCO-2014AP22.9WeakSAM-MIST (with SAM)
2D Object DetectionMS-COCO-2014AP22.3WeakSAM-OICR-Faster RCNN (with SAM)
2D Object DetectionMS-COCO-2014AP19.9WeakSAM-OICR (with SAM)
2D Object DetectionPASCAL VOC 2007MAP73.4WeakSAM-MIST-DINO (with SAM)
2D Object DetectionPASCAL VOC 2007MAP71.8WeakSAM-MIST-Faster RCNN (with SAM)
2D Object DetectionPASCAL VOC 2007MAP67.4WeakSAM-MIST (with SAM)
2D Object DetectionPASCAL VOC 2007MAP66.1WeakSAM-OICR-DINO (with SAM)
2D Object DetectionPASCAL VOC 2007MAP65.7WeakSAM-OICR-Faster RCNN (with SAM)
2D Object DetectionPASCAL VOC 2007MAP58.9WeakSAM-OICR (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP70.2WeakSAM-MIST-DINO (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP69.2WeakSAM-MIST-Faster RCNN (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP66.9WeakSAM-MIST (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP63.7WeakSAM-OICR-DINO (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP62.9WeakSAM-OICR-Faster RCNN (with SAM)
2D Object DetectionPASCAL VOC 2012 testMAP58.4WeakSAM-OICR (with SAM)
16kMS-COCO-2014AP26.6WeakSAM-MIST-DINO (with SAM)
16kMS-COCO-2014AP24.9WeakSAM-OICR-DINO (with SAM)
16kMS-COCO-2014AP23.8WeakSAM-MIST-Faster RCNN (with SAM)
16kMS-COCO-2014AP22.9WeakSAM-MIST (with SAM)
16kMS-COCO-2014AP22.3WeakSAM-OICR-Faster RCNN (with SAM)
16kMS-COCO-2014AP19.9WeakSAM-OICR (with SAM)
16kPASCAL VOC 2007MAP73.4WeakSAM-MIST-DINO (with SAM)
16kPASCAL VOC 2007MAP71.8WeakSAM-MIST-Faster RCNN (with SAM)
16kPASCAL VOC 2007MAP67.4WeakSAM-MIST (with SAM)
16kPASCAL VOC 2007MAP66.1WeakSAM-OICR-DINO (with SAM)
16kPASCAL VOC 2007MAP65.7WeakSAM-OICR-Faster RCNN (with SAM)
16kPASCAL VOC 2007MAP58.9WeakSAM-OICR (with SAM)
16kPASCAL VOC 2012 testMAP70.2WeakSAM-MIST-DINO (with SAM)
16kPASCAL VOC 2012 testMAP69.2WeakSAM-MIST-Faster RCNN (with SAM)
16kPASCAL VOC 2012 testMAP66.9WeakSAM-MIST (with SAM)
16kPASCAL VOC 2012 testMAP63.7WeakSAM-OICR-DINO (with SAM)
16kPASCAL VOC 2012 testMAP62.9WeakSAM-OICR-Faster RCNN (with SAM)
16kPASCAL VOC 2012 testMAP58.4WeakSAM-OICR (with SAM)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21Deep Learning-Based Fetal Lung Segmentation from Diffusion-weighted MRI Images and Lung Maturity Evaluation for Fetal Growth Restriction2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17From Variability To Accuracy: Conditional Bernoulli Diffusion Models with Consensus-Driven Correction for Thin Structure Segmentation2025-07-17Unleashing Vision Foundation Models for Coronary Artery Segmentation: Parallel ViT-CNN Encoding and Variational Fusion2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17