Temporal Ensembling for Semi-Supervised Learning

Samuli Laine, Timo Aila

2016-10-07General Classification Semi-Supervised Image Classification

Paper PDF Code Code Code Code Code Code Code(official)

Abstract

In this paper, we present a simple and efficient method for training deep neural networks in a semi-supervised setting where only a small portion of training data is labeled. We introduce self-ensembling, where we form a consensus prediction of the unknown labels using the outputs of the network-in-training on different epochs, and most importantly, under different regularization and input augmentation conditions. This ensemble prediction can be expected to be a better predictor for the unknown labels than the output of the network at the most recent training epoch, and can thus be used as a target for training. Using our method, we set new records for two standard semi-supervised learning benchmarks, reducing the (non-augmented) classification error rate from 18.44% to 7.05% in SVHN with 500 labels and from 18.63% to 16.55% in CIFAR-10 with 4000 labels, and further to 5.12% and 12.16% by enabling the standard augmentations. We additionally obtain a clear improvement in CIFAR-100 classification accuracy by using random images from the Tiny Images dataset as unlabeled extra inputs during training. Finally, we demonstrate good tolerance to incorrect labels.

Results

Task	Dataset	Metric	Value	Model
Image Classification	CIFAR-10, 4000 Labels	Percentage error	12.16	Pi Model
Image Classification	cifar-100, 10000 Labels	Percentage error	38.65	Temporal ensembling
Image Classification	CIFAR-10, 250 Labels	Percentage error	53.12	Ⅱ-Model
Semi-Supervised Image Classification	CIFAR-10, 4000 Labels	Percentage error	12.16	Pi Model
Semi-Supervised Image Classification	cifar-100, 10000 Labels	Percentage error	38.65	Temporal ensembling
Semi-Supervised Image Classification	CIFAR-10, 250 Labels	Percentage error	53.12	Ⅱ-Model

Related Papers

ViTSGMM: A Robust Semi-Supervised Image Recognition Network Using Sparse Labels2025-06-04 Applications and Effect Evaluation of Generative Adversarial Networks in Semi-Supervised Learning2025-05-26 Simple Semi-supervised Knowledge Distillation from Vision-Language Models via $\mathbf{\texttt{D}}$ual-$\mathbf{\texttt{H}}$ead $\mathbf{\texttt{O}}$ptimization2025-05-12 Weakly Semi-supervised Whole Slide Image Classification by Two-level Cross Consistency Supervision2025-04-16 Specialized text classification: an approach to classifying Open Banking transactions2025-04-10 Diff-SySC: An Approach Using Diffusion Models for Semi-Supervised Image Classification2025-02-25 Universal Training of Neural Networks to Achieve Bayes Optimal Classification Accuracy2025-01-13 Revisiting MLLMs: An In-Depth Analysis of Image Classification Abilities2024-12-21