TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Towards Sustainable Self-supervised Learning

Towards Sustainable Self-supervised Learning

ShangHua Gao, Pan Zhou, Ming-Ming Cheng, Shuicheng Yan

2022-10-20Self-Supervised Image ClassificationSelf-Supervised LearningSemantic SegmentationObject Detection
PaperPDFCode(official)

Abstract

Although increasingly training-expensive, most self-supervised learning (SSL) models have repeatedly been trained from scratch but not fully utilized, since only a few SOTAs are employed for downstream tasks. In this work, we explore a sustainable SSL framework with two major challenges: i) learning a stronger new SSL model based on the existing pretrained SSL model, also called as "base" model, in a cost-friendly manner, ii) allowing the training of the new model to be compatible with various base models. We propose a Target-Enhanced Conditional (TEC) scheme which introduces two components to the existing mask-reconstruction based SSL. Firstly, we propose patch-relation enhanced targets which enhances the target given by base model and encourages the new model to learn semantic-relation knowledge from the base model by using incomplete inputs. This hardening and target-enhancing help the new model surpass the base model, since they enforce additional patch relation modeling to handle incomplete input. Secondly, we introduce a conditional adapter that adaptively adjusts new model prediction to align with the target of different base models. Extensive experimental results show that our TEC scheme can accelerate the learning speed, and also improve SOTA SSL base models, e.g., MAE and iBOT, taking an explorative step towards sustainable SSL.

Results

TaskDatasetMetricValueModel
Semantic SegmentationImageNet-SmIoU (test)62.5TEC (ViT-B/16, 224x224, SSL+FT, mmseg)
Semantic SegmentationImageNet-SmIoU (val)63.2TEC (ViT-B/16, 224x224, SSL+FT, mmseg)
Semantic SegmentationImageNet-SmIoU (val)62TEC (ViT-B/16, 224x224, SSL+FT)
Semantic SegmentationImageNet-SmIoU (test)46TEC (ViT-B/16, 224x224, SSL, mmseg)
Semantic SegmentationImageNet-SmIoU (val)46.1TEC (ViT-B/16, 224x224, SSL, mmseg)
Semantic SegmentationImageNet-SmIoU (val)42.9TEC (ViT-B/16, 224x224, SSL)
Semantic SegmentationADE20KValidation mIoU51TEC (Vit-B, Upernet)
Object DetectionCOCO minivalbox AP54.6TEC(VIT-B, Mask-RCNN)
3DCOCO minivalbox AP54.6TEC(VIT-B, Mask-RCNN)
2D ClassificationCOCO minivalbox AP54.6TEC(VIT-B, Mask-RCNN)
2D Object DetectionCOCO minivalbox AP54.6TEC(VIT-B, Mask-RCNN)
10-shot image generationImageNet-SmIoU (test)62.5TEC (ViT-B/16, 224x224, SSL+FT, mmseg)
10-shot image generationImageNet-SmIoU (val)63.2TEC (ViT-B/16, 224x224, SSL+FT, mmseg)
10-shot image generationImageNet-SmIoU (val)62TEC (ViT-B/16, 224x224, SSL+FT)
10-shot image generationImageNet-SmIoU (test)46TEC (ViT-B/16, 224x224, SSL, mmseg)
10-shot image generationImageNet-SmIoU (val)46.1TEC (ViT-B/16, 224x224, SSL, mmseg)
10-shot image generationImageNet-SmIoU (val)42.9TEC (ViT-B/16, 224x224, SSL)
10-shot image generationADE20KValidation mIoU51TEC (Vit-B, Upernet)
16kCOCO minivalbox AP54.6TEC(VIT-B, Mask-RCNN)

Related Papers

SeC: Advancing Complex Video Object Segmentation via Progressive Concept Construction2025-07-21A Semi-Supervised Learning Method for the Identification of Bad Exposures in Large Imaging Surveys2025-07-17DiffOSeg: Omni Medical Image Segmentation via Multi-Expert Collaboration Diffusion Model2025-07-17SCORE: Scene Context Matters in Open-Vocabulary Remote Sensing Instance Segmentation2025-07-17Unified Medical Image Segmentation with State Space Modeling Snake2025-07-17A Privacy-Preserving Semantic-Segmentation Method Using Domain-Adaptation Technique2025-07-17A Real-Time System for Egocentric Hand-Object Interaction Detection in Industrial Domains2025-07-17RS-TinyNet: Stage-wise Feature Fusion Network for Detecting Tiny Objects in Remote Sensing Images2025-07-17