TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/SSDS

SSDS

Self-Supervised Deep Supervision

GeneralIntroduced 200010 papers
Source Paper

Description

The method exploits the finding that high correlation of segmentation performance among each U-Net's decoder layer -- with discriminative layer attached -- tends to have higher segmentation performance in the final segmentation map. By introducing an "Inter-layer Divergence Loss", based on Kulback-Liebler Divergence, to promotes the consistency between each discriminative output from decoder layers by minimizing the divergence.

If we assume that each decoder layer is equivalent to PDE function parameterized by weight parameter θ\thetaθ:

Decoderi(x;θi)≡PDE(x;θi)Decoder_i(x;\theta_i) \equiv PDE(x;\theta_i)Decoderi​(x;θi​)≡PDE(x;θi​)

Then our objective is trying to make each discriminative output similar to each other:

PDE(x;θd)∼PDE(x;θi); 0≤i<dPDE(x; \theta_d) \sim PDE(x; \theta_i);\text{ } 0 \leq i < dPDE(x;θd​)∼PDE(x;θi​); 0≤i<d

Hence the objective is to minimize∑i=0dDKL(y^∣∣Decoderi)\text{minimize} \sum_{i=0}^{d} D_{KL}(\hat{y} || Decoder_i)minimize∑i=0d​DKL​(y^​∣∣Decoderi​).

Papers Using This Method

Cost-Efficient LLM Training with Lifetime-Aware Tensor Offloading via GPUDirect Storage2025-06-06Harnessing Your DRAM and SSD for Sustainable and Accessible LLM Inference with Mixed-Precision and Multi-level Caching2024-10-17FusionANNS: An Efficient CPU/GPU Cooperative Processing Architecture for Billion-scale Approximate Nearest Neighbor Search2024-09-25InstInfer: In-Storage Attention Offloading for Cost-Effective Long-Context LLM Inference2024-09-08LearnedKV: Integrating LSM and Learned Index for Superior Performance on Storage2024-06-27FastPersist: Accelerating Model Checkpointing in Deep Learning2024-06-19Automatic Speech Recognition (ASR) for the Diagnosis of pronunciation of Speech Sound Disorders in Korean children2024-03-13Semi-Supervised Dialogue Abstractive Summarization via High-Quality Pseudolabel Selection2024-03-06ESPN: Memory-Efficient Multi-Vector Information Retrieval2023-12-09OCTAve: 2D en face Optical Coherence Tomography Angiography Vessel Segmentation in Weakly-Supervised Learning with Locality Augmentation2022-07-25