OCD: Learning to Overfit with Conditional Diffusion Models

Shahar Lutati, Lior Wolf

2022-10-02Denoising Image Classification Speech Separation 3D Reconstruction Few-Shot Text Classification

Abstract

We present a dynamic model in which the weights are conditioned on an input sample x and are learned to match those that would be obtained by finetuning a base model on x and its label y. This mapping between an input sample and network weights is approximated by a denoising diffusion model. The diffusion model we employ focuses on modifying a single layer of the base model and is conditioned on the input, activations, and output of this layer. Since the diffusion model is stochastic in nature, multiple initializations generate different networks, forming an ensemble, which leads to further improvements. Our experiments demonstrate the wide applicability of the method for image classification, 3D reconstruction, tabular data, speech separation, and natural language processing. Our code is available at https://github.com/ShaharLutatiPersonal/OCD

Results

Task	Dataset	Metric	Value	Model
Speech Separation	Libri5Mix	SI-SDRi	13.4	OCD
Text Classification	SST-5	Accuracy	0.478	SetFit + OCD
Text Classification	Average on NLP datasets	Accuracy	0.648	SetFit + OCD(5)
Text Classification	Average on NLP datasets	Accuracy	0.643	SetFit + OCD
Text Classification	Average on NLP datasets	Accuracy	0.633	T-few 3B
Text Classification	Average on NLP datasets	Accuracy	0.622	SetFit
Text Classification	Amazon Counterfeit	Accuracy	0.41	SetFit + OCD
Few-Shot Text Classification	SST-5	Accuracy	0.478	SetFit + OCD
Few-Shot Text Classification	Average on NLP datasets	Accuracy	0.648	SetFit + OCD(5)
Few-Shot Text Classification	Average on NLP datasets	Accuracy	0.643	SetFit + OCD
Few-Shot Text Classification	Average on NLP datasets	Accuracy	0.633	T-few 3B
Few-Shot Text Classification	Average on NLP datasets	Accuracy	0.622	SetFit
Few-Shot Text Classification	Amazon Counterfeit	Accuracy	0.41	SetFit + OCD
Classification	SST-5	Accuracy	0.478	SetFit + OCD
Classification	Average on NLP datasets	Accuracy	0.648	SetFit + OCD(5)
Classification	Average on NLP datasets	Accuracy	0.643	SetFit + OCD
Classification	Average on NLP datasets	Accuracy	0.633	T-few 3B
Classification	Average on NLP datasets	Accuracy	0.622	SetFit
Classification	Amazon Counterfeit	Accuracy	0.41	SetFit + OCD

OCD: Learning to Overfit with Conditional Diffusion Models

Abstract

Results

Related Papers

OCD: Learning to Overfit with Conditional Diffusion Models

Abstract

Results

Related Papers