SimMatch: Semi-supervised Learning with Similarity Matching

Mingkai Zheng, Shan You, Lang Huang, Fei Wang, Chen Qian, Chang Xu

2022-03-14CVPR 2022 1Semantic Similarity Semantic Textual Similarity Semi-Supervised Image Classification

Abstract

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers semantic similarity and instance similarity. In SimMatch, the consistency regularization will be applied on both semantic-level and instance-level. The different augmented views of the same instance are encouraged to have the same class prediction and similar similarity relationship respected to other instances. Next, we instantiated a labeled memory buffer to fully leverage the ground truth labels on instance-level and bridge the gaps between the semantic and instance similarities. Finally, we proposed the \textit{unfolding} and \textit{aggregation} operation which allows these two similarities be isomorphically transformed with each other. In this way, the semantic and instance pseudo-labels can be mutually propagated to generate more high-quality and reliable matching targets. Extensive experimental results demonstrate that SimMatch improves the performance of semi-supervised learning tasks across different benchmark datasets and different settings. Notably, with 400 epochs of training, SimMatch achieves 67.2\%, and 74.4\% Top-1 Accuracy with 1\% and 10\% labeled examples on ImageNet, which significantly outperforms the baseline methods and is better than previous semi-supervised learning frameworks. Code and pre-trained models are available at https://github.com/KyleZheng1997/simmatch.

Results

Task	Dataset	Metric	Value	Model
Image Classification	CIFAR-10, 4000 Labels	Percentage error	3.96	SimMatch
Image Classification	CIFAR-100, 2500 Labels	Percentage error	25.07	SimMatch
Image Classification	cifar-100, 10000 Labels	Percentage error	20.58	SimMatch
Image Classification	CIFAR-100, 400 Labels	Percentage error	37.81	SimMatch
Image Classification	CIFAR-10, 40 Labels	Percentage error	5.6	SimMatch
Image Classification	CIFAR-10, 250 Labels	Percentage error	4.84	SimMatch
Semi-Supervised Image Classification	CIFAR-10, 4000 Labels	Percentage error	3.96	SimMatch
Semi-Supervised Image Classification	CIFAR-100, 2500 Labels	Percentage error	25.07	SimMatch
Semi-Supervised Image Classification	cifar-100, 10000 Labels	Percentage error	20.58	SimMatch
Semi-Supervised Image Classification	CIFAR-100, 400 Labels	Percentage error	37.81	SimMatch
Semi-Supervised Image Classification	CIFAR-10, 40 Labels	Percentage error	5.6	SimMatch
Semi-Supervised Image Classification	CIFAR-10, 250 Labels	Percentage error	4.84	SimMatch

SimMatch: Semi-supervised Learning with Similarity Matching

Abstract

Results

Related Papers

SimMatch: Semi-supervised Learning with Similarity Matching

Abstract

Results

Related Papers