TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo...

In Defense of Pseudo-Labeling: An Uncertainty-Aware Pseudo-label Selection Framework for Semi-Supervised Learning

Mamshad Nayeem Rizve, Kevin Duarte, Yogesh S Rawat, Mubarak Shah

2021-01-15ICLR 2021 1Semi-Supervised Video ClassificationSemi-supervised Medical Image ClassificationMulti-Label ClassificationSemi-Supervised Image Classification
PaperPDFCode(official)Code

Abstract

The recent research in semi-supervised learning (SSL) is mostly dominated by consistency regularization based methods which achieve strong performance. However, they heavily rely on domain-specific data augmentations, which are not easy to generate for all data modalities. Pseudo-labeling (PL) is a general SSL approach that does not have this constraint but performs relatively poorly in its original formulation. We argue that PL underperforms due to the erroneous high confidence predictions from poorly calibrated models; these predictions generate many incorrect pseudo-labels, leading to noisy training. We propose an uncertainty-aware pseudo-label selection (UPS) framework which improves pseudo labeling accuracy by drastically reducing the amount of noise encountered in the training process. Furthermore, UPS generalizes the pseudo-labeling process, allowing for the creation of negative pseudo-labels; these negative pseudo-labels can be used for multi-label classification as well as negative learning to improve the single-label classification. We achieve strong performance when compared to recent SSL methods on the CIFAR-10 and CIFAR-100 datasets. Also, we demonstrate the versatility of our method on the video dataset UCF-101 and the multi-label dataset Pascal VOC.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10, 4000 LabelsPercentage error4.86UPS (Shake-Shake)
Image ClassificationCIFAR-10, 1000 LabelsAccuracy91.82UPS (CNN-13)
Image Classificationcifar-100, 10000 LabelsPercentage error32UPS (CNN-13)
Image ClassificationCIFAR-100, 4000 LabelsAccuracy59.23UPS (CNN-13)
ClassificationChest X-Ray14 2% labeledAUC65.51UPS
Medical Image ClassificationChest X-Ray14 2% labeledAUC65.51UPS
Semi-Supervised Image ClassificationCIFAR-10, 4000 LabelsPercentage error4.86UPS (Shake-Shake)
Semi-Supervised Image ClassificationCIFAR-10, 1000 LabelsAccuracy91.82UPS (CNN-13)
Semi-Supervised Image Classificationcifar-100, 10000 LabelsPercentage error32UPS (CNN-13)
Semi-Supervised Image ClassificationCIFAR-100, 4000 LabelsAccuracy59.23UPS (CNN-13)

Related Papers

Privacy-Preserving Chest X-ray Classification in Latent Space with Homomorphically Encrypted Neural Inference2025-06-18Explainable Detection of Implicit Influential Patterns in Conversations via Data Augmentation2025-06-17AgriPotential: A Novel Multi-Spectral and Multi-Temporal Remote Sensing Dataset for Agricultural Potentials2025-06-13MUDAS: Mote-scale Unsupervised Domain Adaptation in Multi-label Sound Classification2025-06-12ToxSyn-PT: A Large-Scale Synthetic Dataset for Hate Speech Detection in Portuguese2025-06-11Single GPU Task Adaptation of Pathology Foundation Models for Whole Slide Image Analysis2025-06-05ViTSGMM: A Robust Semi-Supervised Image Recognition Network Using Sparse Labels2025-06-04PatchDEMUX: A Certifiably Robust Framework for Multi-label Classifiers Against Adversarial Patches2025-05-30