TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Debiased Learning from Naturally Imbalanced Pseudo-Labels

Debiased Learning from Naturally Imbalanced Pseudo-Labels

Xudong Wang, Zhirong Wu, Long Lian, Stella X. Yu

2022-01-05CVPR 2022 1Counterfactual ReasoningFew-Shot Image Classificationimbalanced classificationZero-Shot LearningSemi-Supervised Image Classification
PaperPDFCode(official)

Abstract

Pseudo-labels are confident predictions made on unlabeled target data by a classifier trained on labeled source data. They are widely used for adapting a model to unlabeled data, e.g., in a semi-supervised learning setting. Our key insight is that pseudo-labels are naturally imbalanced due to intrinsic data similarity, even when a model is trained on balanced source data and evaluated on balanced target data. If we address this previously unknown imbalanced classification problem arising from pseudo-labels instead of ground-truth training labels, we could remove model biases towards false majorities created by pseudo-labels. We propose a novel and effective debiased learning method with pseudo-labels, based on counterfactual reasoning and adaptive margins: The former removes the classifier response bias, whereas the latter adjusts the margin of each class according to the imbalance of pseudo-labels. Validated by extensive experimentation, our simple debiased learning delivers significant accuracy gains over the state-of-the-art on ImageNet-1K: 26% for semi-supervised learning with 0.2% annotations and 9% for zero-shot learning. Our code is available at: https://github.com/frank-xwang/debiased-pseudo-labeling.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10, 40 LabelsPercentage error5.4DebiasPL (w/ FixMatch)
Image ClassificationCIFAR-10, 250 LabelsPercentage error4.6DebiasPL (w/ FixMatch)
Semi-Supervised Image ClassificationCIFAR-10, 40 LabelsPercentage error5.4DebiasPL (w/ FixMatch)
Semi-Supervised Image ClassificationCIFAR-10, 250 LabelsPercentage error4.6DebiasPL (w/ FixMatch)

Related Papers

GLAD: Generalizable Tuning for Vision-Language Models2025-07-17DEARLi: Decoupled Enhancement of Recognition and Localization for Semi-supervised Panoptic Segmentation2025-07-14ViT-ProtoNet for Few-Shot Image Classification: A Multi-Benchmark Evaluation2025-07-12Active Inference AI Systems for Scientific Discovery2025-06-26HalluSegBench: Counterfactual Visual Reasoning for Segmentation Hallucination Evaluation2025-06-26EVA: Mixture-of-Experts Semantic Variant Alignment for Compositional Zero-Shot Learning2025-06-26Zero-Shot Learning for Obsolescence Risk Forecasting2025-06-26SEZ-HARN: Self-Explainable Zero-shot Human Activity Recognition Network2025-06-25