TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/MixMatch: A Holistic Approach to Semi-Supervised Learning

MixMatch: A Holistic Approach to Semi-Supervised Learning

David Berthelot, Nicholas Carlini, Ian Goodfellow, Nicolas Papernot, Avital Oliver, Colin Raffel

2019-05-06NeurIPS 2019 12Image ClassificationSemi-Supervised Image Classification
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCode

Abstract

Semi-supervised learning has proven to be a powerful paradigm for leveraging unlabeled data to mitigate the reliance on large labeled datasets. In this work, we unify the current dominant approaches for semi-supervised learning to produce a new algorithm, MixMatch, that works by guessing low-entropy labels for data-augmented unlabeled examples and mixing labeled and unlabeled data using MixUp. We show that MixMatch obtains state-of-the-art results by a large margin across many datasets and labeled data amounts. For example, on CIFAR-10 with 250 labels, we reduce error rate by a factor of 4 (from 38% to 11%) and by a factor of 2 on STL-10. We also demonstrate how MixMatch can help achieve a dramatically better accuracy-privacy trade-off for differential privacy. Finally, we perform an ablation study to tease apart which components of MixMatch are most important for its success.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10Percentage correct95.05MixMatch
Image ClassificationCIFAR-100Percentage correct74.1MixMatch
Image ClassificationSTL-10Percentage correct94.41MixMatch
Image ClassificationSTL-10Percentage correct89.82MixMatch
Image ClassificationSTL-10Percentage correct87.36CutOut
Image ClassificationSVHNPercentage error2.59MixMatch
Image ClassificationCIFAR-10, 4000 LabelsPercentage error6.24MixMatch
Image ClassificationCIFAR-10, 2000 LabelsAccuracy92.97MixMatch
Image ClassificationSTL-10, 1000 LabelsAccuracy89.82MixMatch
Image ClassificationSVHN, 500 LabelsAccuracy96.36MixMatch
Image ClassificationSVHN, 2000 LabelsAccuracy96.96MixMatch
Image ClassificationCIFAR-10, 1000 LabelsAccuracy92.25MixMatch
Image ClassificationCIFAR-10, 500 LabelsAccuracy91.35MixMatch
Image ClassificationSVHN, 4000 LabelsAccuracy97.11MixMatch
Image ClassificationSVHN, 1000 labelsAccuracy96.73MixMatch
Image ClassificationSTL-10, 5000 LabelsAccuracy94.41MixMatch
Image ClassificationSVHN, 250 LabelsAccuracy96.22MixMatch
Image ClassificationCIFAR-10, 250 LabelsPercentage error11.08MixMatch
Semi-Supervised Image ClassificationCIFAR-10, 4000 LabelsPercentage error6.24MixMatch
Semi-Supervised Image ClassificationCIFAR-10, 2000 LabelsAccuracy92.97MixMatch
Semi-Supervised Image ClassificationSTL-10, 1000 LabelsAccuracy89.82MixMatch
Semi-Supervised Image ClassificationSVHN, 500 LabelsAccuracy96.36MixMatch
Semi-Supervised Image ClassificationSVHN, 2000 LabelsAccuracy96.96MixMatch
Semi-Supervised Image ClassificationCIFAR-10, 1000 LabelsAccuracy92.25MixMatch
Semi-Supervised Image ClassificationCIFAR-10, 500 LabelsAccuracy91.35MixMatch
Semi-Supervised Image ClassificationSVHN, 4000 LabelsAccuracy97.11MixMatch
Semi-Supervised Image ClassificationSVHN, 1000 labelsAccuracy96.73MixMatch
Semi-Supervised Image ClassificationSTL-10, 5000 LabelsAccuracy94.41MixMatch
Semi-Supervised Image ClassificationSVHN, 250 LabelsAccuracy96.22MixMatch
Semi-Supervised Image ClassificationCIFAR-10, 250 LabelsPercentage error11.08MixMatch

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Hashed Watermark as a Filter: Defeating Forging and Overwriting Attacks in Weight-based Neural Network Watermarking2025-07-15Transferring Styles for Reduced Texture Bias and Improved Robustness in Semantic Segmentation Networks2025-07-14FedGSCA: Medical Federated Learning with Global Sample Selector and Client Adaptive Adjuster under Label Noise2025-07-13