TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/AutoAugment: Learning Augmentation Policies from Data

AutoAugment: Learning Augmentation Policies from Data

Ekin D. Cubuk, Barret Zoph, Dandelion Mane, Vijay Vasudevan, Quoc V. Le

2018-05-24Image AugmentationImage ClassificationData AugmentationDomain GeneralizationFine-Grained Image Classification
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

Data augmentation is an effective technique for improving the accuracy of modern image classifiers. However, current data augmentation implementations are manually designed. In this paper, we describe a simple procedure called AutoAugment to automatically search for improved data augmentation policies. In our implementation, we have designed a search space where a policy consists of many sub-policies, one of which is randomly chosen for each image in each mini-batch. A sub-policy consists of two operations, each operation being an image processing function such as translation, rotation, or shearing, and the probabilities and magnitudes with which the functions are applied. We use a search algorithm to find the best policy such that the neural network yields the highest validation accuracy on a target dataset. Our method achieves state-of-the-art accuracy on CIFAR-10, CIFAR-100, SVHN, and ImageNet (without additional data). On ImageNet, we attain a Top-1 accuracy of 83.5% which is 0.4% better than the previous record of 83.1%. On CIFAR-10, we achieve an error rate of 1.5%, which is 0.6% better than the previous state-of-the-art. Augmentation policies we find are transferable between datasets. The policy learned on ImageNet transfers well to achieve significant improvements on other datasets, such as Oxford Flowers, Caltech-101, Oxford-IIT Pets, FGVC Aircraft, and Stanford Cars.

Results

TaskDatasetMetricValueModel
Domain AdaptationVizWiz-ClassificationAccuracy - All Images42.6EfficientNet-B3 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images47.5EfficientNet-B3 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images34.9EfficientNet-B3 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images41.6EfficientNet-B2 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images45.8EfficientNet-B2 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images34.3EfficientNet-B2 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images39.7EfficientNet-B1 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images44.4EfficientNet-B1 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images32.8EfficientNet-B1 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - All Images34.9EfficientNet-B0 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Clean Images40.1EfficientNet-B0 (autoaug)
Domain AdaptationVizWiz-ClassificationAccuracy - Corrupted Images27.3EfficientNet-B0 (autoaug)
Data AugmentationImageNetAccuracy (%)80ResNet-200 (AA)
Data AugmentationImageNetAccuracy (%)77.6ResNet-50 (AA)
Image ClassificationCIFAR-100Percentage correct89.3PyramidNet+ShakeDrop
Image ClassificationFGVC AircraftTop-1 Error Rate7.33AutoAugment
Fine-Grained Image ClassificationFGVC AircraftTop-1 Error Rate7.33AutoAugment
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images42.6EfficientNet-B3 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images47.5EfficientNet-B3 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images34.9EfficientNet-B3 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images41.6EfficientNet-B2 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images45.8EfficientNet-B2 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images34.3EfficientNet-B2 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images39.7EfficientNet-B1 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images44.4EfficientNet-B1 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images32.8EfficientNet-B1 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - All Images34.9EfficientNet-B0 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Clean Images40.1EfficientNet-B0 (autoaug)
Domain GeneralizationVizWiz-ClassificationAccuracy - Corrupted Images27.3EfficientNet-B0 (autoaug)

Related Papers

Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17Overview of the TalentCLEF 2025: Skill and Job Title Intelligence for Human Capital Management2025-07-17Pixel Perfect MegaMed: A Megapixel-Scale Vision-Language Foundation Model for Generating High Resolution Medical Images2025-07-17Simulate, Refocus and Ensemble: An Attention-Refocusing Scheme for Domain Generalization2025-07-17