TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/ReMixMatch: Semi-Supervised Learning with Distribution Ali...

ReMixMatch: Semi-Supervised Learning with Distribution Alignment and Augmentation Anchoring

David Berthelot, Nicholas Carlini, Ekin D. Cubuk, Alex Kurakin, Kihyuk Sohn, Han Zhang, Colin Raffel

2019-11-21Image ClassificationSemi-Supervised Image Classification
PaperPDFCode(official)Code(official)Code

Abstract

We improve the recently-proposed "MixMatch" semi-supervised learning algorithm by introducing two new techniques: distribution alignment and augmentation anchoring. Distribution alignment encourages the marginal distribution of predictions on unlabeled data to be close to the marginal distribution of ground-truth labels. Augmentation anchoring feeds multiple strongly augmented versions of an input into the model and encourages each output to be close to the prediction for a weakly-augmented version of the same input. To produce strong augmentations, we propose a variant of AutoAugment which learns the augmentation policy while the model is being trained. Our new algorithm, dubbed ReMixMatch, is significantly more data-efficient than prior work, requiring between $5\times$ and $16\times$ less data to reach the same accuracy. For example, on CIFAR-10 with 250 labeled examples we reach $93.73\%$ accuracy (compared to MixMatch's accuracy of $93.58\%$ with $4{,}000$ examples) and a median accuracy of $84.92\%$ with just four labels per class. We make our code and data open-source at https://github.com/google-research/remixmatch.

Results

TaskDatasetMetricValueModel
Image ClassificationSTL-10Percentage correct93.82ReMixMatch (K=4)
Image ClassificationSTL-10Percentage correct93.23ReMixMatch (K=1)
Image ClassificationSTL-10Percentage correct77.8CC-GAN
Image ClassificationCIFAR-10, 4000 LabelsPercentage error5.14ReMixMatch
Image ClassificationSTL-10, 1000 LabelsAccuracy93.82ReMixMatch
Image ClassificationSVHN, 1000 labelsAccuracy97.17ReMixMatch
Image Classificationcifar10, 250 LabelsPercentage correct93.73ReMixMatch
Image ClassificationCIFAR-10, 40 LabelsPercentage error19.1ReMixMatch
Image ClassificationCIFAR-10, 250 LabelsPercentage error6.27ReMixMatch
Semi-Supervised Image ClassificationCIFAR-10, 4000 LabelsPercentage error5.14ReMixMatch
Semi-Supervised Image ClassificationSTL-10, 1000 LabelsAccuracy93.82ReMixMatch
Semi-Supervised Image ClassificationSVHN, 1000 labelsAccuracy97.17ReMixMatch
Semi-Supervised Image Classificationcifar10, 250 LabelsPercentage correct93.73ReMixMatch
Semi-Supervised Image ClassificationCIFAR-10, 40 LabelsPercentage error19.1ReMixMatch
Semi-Supervised Image ClassificationCIFAR-10, 250 LabelsPercentage error6.27ReMixMatch

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13