Unsupervised Data Augmentation for Consistency Training

Qizhe Xie, Zihang Dai, Eduard Hovy, Minh-Thang Luong, Quoc V. Le

2019-04-29NeurIPS 2020 12Text Classification Image Augmentation Image Classification Sentiment Analysis Data Augmentation Transfer Learning Semi-Supervised Image Classification

Paper PDF Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code(official)Code Code Code

Abstract

Semi-supervised learning lately has shown much promise in improving deep learning models when labeled data is scarce. Common among recent approaches is the use of consistency training on a large amount of unlabeled data to constrain model predictions to be invariant to input noise. In this work, we present a new perspective on how to effectively noise unlabeled examples and argue that the quality of noising, specifically those produced by advanced data augmentation methods, plays a crucial role in semi-supervised learning. By substituting simple noising operations with advanced data augmentation methods such as RandAugment and back-translation, our method brings substantial improvements across six language and three vision tasks under the same consistency training framework. On the IMDb text classification dataset, with only 20 labeled examples, our method achieves an error rate of 4.20, outperforming the state-of-the-art model trained on 25,000 labeled examples. On a standard semi-supervised learning benchmark, CIFAR-10, our method outperforms all previous approaches and achieves an error rate of 5.43 with only 250 examples. Our method also combines well with transfer learning, e.g., when finetuning from BERT, and yields improvements in high-data regime, such as ImageNet, whether when there is only 10% labeled data or when a full labeled set with 1.3M extra unlabeled examples is used. Code is available at https://github.com/google-research/uda.

Results

Task	Dataset	Metric	Value	Model
Sentiment Analysis	Amazon Review Polarity	Accuracy	97.37	BERT large
Sentiment Analysis	Amazon Review Polarity	Accuracy	96.5	BERT large finetune UDA
Sentiment Analysis	Yelp Fine-grained classification	Error	29.32	BERT large
Sentiment Analysis	Yelp Fine-grained classification	Error	32.08	BERT large finetune UDA
Sentiment Analysis	Yelp Binary classification	Error	1.89	BERT large
Sentiment Analysis	Yelp Binary classification	Error	2.05	BERT large finetune UDA
Sentiment Analysis	IMDb	Accuracy	95.8	BERT large finetune UDA
Sentiment Analysis	IMDb	Accuracy	95.49	BERT large
Sentiment Analysis	Amazon Review Full	Accuracy	65.83	BERT large
Sentiment Analysis	Amazon Review Full	Accuracy	62.88	BERT large finetune UDA
Text Classification	DBpedia	Error	0.68	BERT large
Text Classification	DBpedia	Error	1.09	BERT large UDA
Text Classification	Amazon-5	Error	37.12	BERT Finetune + UDA
Text Classification	Amazon-2	Error	3.5	BERT Finetune + UDA
Image Classification	CIFAR-10, 4000 Labels	Percentage error	5.27	UDA
Image Classification	ImageNet - 10% labeled data	Top 5 Accuracy	88.52	UDA
Image Classification	SVHN, 1000 labels	Accuracy	97.54	UDA
Classification	DBpedia	Error	0.68	BERT large
Classification	DBpedia	Error	1.09	BERT large UDA
Classification	Amazon-5	Error	37.12	BERT Finetune + UDA
Classification	Amazon-2	Error	3.5	BERT Finetune + UDA
Semi-Supervised Image Classification	CIFAR-10, 4000 Labels	Percentage error	5.27	UDA
Semi-Supervised Image Classification	ImageNet - 10% labeled data	Top 5 Accuracy	88.52	UDA
Semi-Supervised Image Classification	SVHN, 1000 labels	Accuracy	97.54	UDA

Unsupervised Data Augmentation for Consistency Training

Abstract

Results

Related Papers

Unsupervised Data Augmentation for Consistency Training

Abstract

Results

Related Papers