TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/UniMatch V2: Pushing the Limit of Semi-Supervised Semantic...

UniMatch V2: Pushing the Limit of Semi-Supervised Semantic Segmentation

Lihe Yang, Zhen Zhao, Hengshuang Zhao

2024-10-14Semi-supervised Change DetectionSemi-Supervised Semantic SegmentationSemantic Segmentation
PaperPDFCode(official)

Abstract

Semi-supervised semantic segmentation (SSS) aims at learning rich visual knowledge from cheap unlabeled images to enhance semantic segmentation capability. Among recent works, UniMatch improves its precedents tremendously by amplifying the practice of weak-to-strong consistency regularization. Subsequent works typically follow similar pipelines and propose various delicate designs. Despite the achieved progress, strangely, even in this flourishing era of numerous powerful vision models, almost all SSS works are still sticking to 1) using outdated ResNet encoders with small-scale ImageNet-1K pre-training, and 2) evaluation on simple Pascal and Cityscapes datasets. In this work, we argue that, it is necessary to switch the baseline of SSS from ResNet-based encoders to more capable ViT-based encoders (e.g., DINOv2) that are pre-trained on massive data. A simple update on the encoder (even using 2x fewer parameters) can bring more significant improvement than careful method designs. Built on this competitive baseline, we present our upgraded and simplified UniMatch V2, inheriting the core spirit of weak-to-strong consistency from V1, but requiring less training cost and providing consistently better results. Additionally, witnessing the gradually saturated performance on Pascal and Cityscapes, we appeal that we should focus on more challenging benchmarks with complex taxonomy, such as ADE20K and COCO datasets. Code, models, and logs of all reported values, are available at https://github.com/LiheYoung/UniMatch-V2.

Results

TaskDatasetMetricValueModel
Semantic SegmentationCOCO 1/512 labeledValidation mIoU47.9UniMatch V2
Semantic SegmentationCOCO 1/256 labeledValidation mIoU55.8UniMatch V2
Semantic SegmentationADE20K 1/16 labeledValidation mIoU46.7UniMatch V2
Semantic SegmentationPASCAL VOC 2012 92 labeledValidation mIoU86.3UniMatch V2 (DINOv2-B)
Semantic SegmentationADE20K 1/32 labeledValidation mIoU45UniMatch V2
Semantic SegmentationPASCAL VOC 2012 732 labeledValidation mIoU90UniMatch V2 (DINOv2-B)
Semantic SegmentationPASCAL VOC 2012 1464 labelsValidation mIoU90.8UniMatch V2 (DINOv2-B)
Semantic SegmentationCOCO 1/128 labeledValidation mIoU58.7UniMatch V2
Semantic SegmentationCOCO 1/64 labeledValidation mIoU60.4UniMatch V2
Semantic SegmentationPASCAL VOC 2012 366 labeledValidation mIoU88.9UniMatch V2 (DINOv2-B)
Semantic SegmentationCityscapes 6.25% labeledValidation mIoU83.6UniMatch V2 (DINOv2-B)
Semantic SegmentationCOCO 1/32 labeledValidation mIoU63.3UniMatch V2
Semantic SegmentationPASCAL VOC 2012 183 labeledValidation mIoU87.9UniMatch V2 (DINOv2-B)
Change DetectionWHU - 20% labeled dataIoU87.9UniMatch V2
Change DetectionWHU - 20% labeled dataOA99.5UniMatch V2
Change DetectionWHU - 40% labeled dataIoU88.6UniMatch V2
Change DetectionWHU - 40% labeled dataOA99.52UniMatch V2
Change DetectionLEVIR-CD - 10% labeled dataIoU83.8UniMatch V2
Change DetectionLEVIR-CD - 10% labeled dataOA99.11UniMatch V2
Change DetectionLEVIR-CD - 5% labeled dataIoU83.3UniMatch V2
Change DetectionLEVIR-CD - 5% labeled dataOA99.08UniMatch V2
Change DetectionLEVIR-CD - 20% labeled dataIoU84.3UniMatch V2
Change DetectionLEVIR-CD - 20% labeled dataOA99.14UniMatch V2
Change DetectionLEVIR-CD - 40% labeled dataIoU84.3UniMatch V2
Change DetectionLEVIR-CD - 40% labeled dataOA99.14UniMatch V2
10-shot image generationCOCO 1/512 labeledValidation mIoU47.9UniMatch V2
10-shot image generationCOCO 1/256 labeledValidation mIoU55.8UniMatch V2
10-shot image generationADE20K 1/16 labeledValidation mIoU46.7UniMatch V2
10-shot image generationPASCAL VOC 2012 92 labeledValidation mIoU86.3UniMatch V2 (DINOv2-B)
10-shot image generationADE20K 1/32 labeledValidation mIoU45UniMatch V2
10-shot image generationPASCAL VOC 2012 732 labeledValidation mIoU90UniMatch V2 (DINOv2-B)
10-shot image generationPASCAL VOC 2012 1464 labelsValidation mIoU90.8UniMatch V2 (DINOv2-B)
10-shot image generationCOCO 1/128 labeledValidation mIoU58.7UniMatch V2
10-shot image generationCOCO 1/64 labeledValidation mIoU60.4UniMatch V2
10-shot image generationPASCAL VOC 2012 366 labeledValidation mIoU88.9UniMatch V2 (DINOv2-B)
10-shot image generationCityscapes 6.25% labeledValidation mIoU83.6UniMatch V2 (DINOv2-B)
10-shot image generationCOCO 1/32 labeledValidation mIoU63.3UniMatch V2
10-shot image generationPASCAL VOC 2012 183 labeledValidation mIoU87.9UniMatch V2 (DINOv2-B)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17SAMST: A Transformer framework based on SAM pseudo label filtering for remote sensing semi-supervised semantic segmentation2025-07-16Tomato Multi-Angle Multi-Pose Dataset for Fine-Grained Phenotyping2025-07-15U-RWKV: Lightweight medical image segmentation with direction-adaptive RWKV2025-07-15