TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/SimMatch: Semi-supervised Learning with Similarity Matching

SimMatch: Semi-supervised Learning with Similarity Matching

Mingkai Zheng, Shan You, Lang Huang, Fei Wang, Chen Qian, Chang Xu

2022-03-14CVPR 2022 1Semantic SimilaritySemantic Textual SimilaritySemi-Supervised Image Classification
PaperPDFCode(official)

Abstract

Learning with few labeled data has been a longstanding problem in the computer vision and machine learning research community. In this paper, we introduced a new semi-supervised learning framework, SimMatch, which simultaneously considers semantic similarity and instance similarity. In SimMatch, the consistency regularization will be applied on both semantic-level and instance-level. The different augmented views of the same instance are encouraged to have the same class prediction and similar similarity relationship respected to other instances. Next, we instantiated a labeled memory buffer to fully leverage the ground truth labels on instance-level and bridge the gaps between the semantic and instance similarities. Finally, we proposed the \textit{unfolding} and \textit{aggregation} operation which allows these two similarities be isomorphically transformed with each other. In this way, the semantic and instance pseudo-labels can be mutually propagated to generate more high-quality and reliable matching targets. Extensive experimental results demonstrate that SimMatch improves the performance of semi-supervised learning tasks across different benchmark datasets and different settings. Notably, with 400 epochs of training, SimMatch achieves 67.2\%, and 74.4\% Top-1 Accuracy with 1\% and 10\% labeled examples on ImageNet, which significantly outperforms the baseline methods and is better than previous semi-supervised learning frameworks. Code and pre-trained models are available at https://github.com/KyleZheng1997/simmatch.

Results

TaskDatasetMetricValueModel
Image ClassificationCIFAR-10, 4000 LabelsPercentage error3.96SimMatch
Image ClassificationCIFAR-100, 2500 LabelsPercentage error25.07SimMatch
Image Classificationcifar-100, 10000 LabelsPercentage error20.58SimMatch
Image ClassificationCIFAR-100, 400 LabelsPercentage error37.81SimMatch
Image ClassificationCIFAR-10, 40 LabelsPercentage error5.6SimMatch
Image ClassificationCIFAR-10, 250 LabelsPercentage error4.84SimMatch
Semi-Supervised Image ClassificationCIFAR-10, 4000 LabelsPercentage error3.96SimMatch
Semi-Supervised Image ClassificationCIFAR-100, 2500 LabelsPercentage error25.07SimMatch
Semi-Supervised Image Classificationcifar-100, 10000 LabelsPercentage error20.58SimMatch
Semi-Supervised Image ClassificationCIFAR-100, 400 LabelsPercentage error37.81SimMatch
Semi-Supervised Image ClassificationCIFAR-10, 40 LabelsPercentage error5.6SimMatch
Semi-Supervised Image ClassificationCIFAR-10, 250 LabelsPercentage error4.84SimMatch

Related Papers

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts2025-07-17SARA: Selective and Adaptive Retrieval-augmented Generation with Context Compression2025-07-08FA: Forced Prompt Learning of Vision-Language Models for Out-of-Distribution Detection2025-07-06LineRetriever: Planning-Aware Observation Reduction for Web Agents2025-06-30Enhancing Automatic Term Extraction with Large Language Models via Syntactic Retrieval2025-06-26DALR: Dual-level Alignment Learning for Multimodal Sentence Representation Learning2025-06-26Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation2025-06-25Leveraging Vision-Language Models to Select Trustworthy Super-Resolution Samples Generated by Diffusion Models2025-06-25