EfficientNetV2: Smaller Models and Faster Training

Mingxing Tan, Quoc V. Le

2021-04-01Image Classification AutoML Data Augmentation Neural Architecture Search Classification

Paper PDF Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code Code(official)Code Code Code Code Code Code Code Code Code

Abstract

This paper introduces EfficientNetV2, a new family of convolutional networks that have faster training speed and better parameter efficiency than previous models. To develop this family of models, we use a combination of training-aware neural architecture search and scaling, to jointly optimize training speed and parameter efficiency. The models were searched from the search space enriched with new ops such as Fused-MBConv. Our experiments show that EfficientNetV2 models train much faster than state-of-the-art models while being up to 6.8x smaller. Our training can be further sped up by progressively increasing the image size during training, but it often causes a drop in accuracy. To compensate for this accuracy drop, we propose to adaptively adjust regularization (e.g., dropout and data augmentation) as well, such that we can achieve both fast training and good accuracy. With progressive learning, our EfficientNetV2 significantly outperforms previous models on ImageNet and CIFAR/Cars/Flowers datasets. By pretraining on the same ImageNet21k, our EfficientNetV2 achieves 87.3% top-1 accuracy on ImageNet ILSVRC2012, outperforming the recent ViT by 2.0% accuracy while training 5x-11x faster using the same computing resources. Code will be available at https://github.com/google/automl/tree/master/efficientnetv2.

Results

Task	Dataset	Metric	Value	Model
Image Classification	Stanford Cars	Accuracy	95.1	EfficientNetV2-L
Image Classification	Stanford Cars	Accuracy	94.6	EfficientNetV2-M
Image Classification	Stanford Cars	Accuracy	93.8	EfficientNetV2-S
Image Classification	CIFAR-10	Percentage correct	99.1	EfficientNetV2-L
Image Classification	CIFAR-10	Percentage correct	99	EfficientNetV2-M
Image Classification	CIFAR-10	Percentage correct	98.7	EfficientNetV2-S
Image Classification	Flowers-102	Accuracy	98.8	EfficientNetV2-L
Image Classification	Flowers-102	Accuracy	98.5	EfficientNetV2-M
Image Classification	Flowers-102	Accuracy	97.9	EfficientNetV2-S
Image Classification	CIFAR-100	Percentage correct	92.3	EfficientNetV2-L
Image Classification	CIFAR-100	Percentage correct	92.2	EfficientNetV2-M
Image Classification	CIFAR-100	Percentage correct	91.5	EfficientNetV2-S
Image Classification	ImageNet	GFLOPs	94	EfficientNetV2-XL (21k)
Image Classification	ImageNet	GFLOPs	53	EfficientNetV2-L (21k)
Image Classification	ImageNet	GFLOPs	24	EfficientNetV2-M (21k)
Image Classification	ImageNet	GFLOPs	53	EfficientNetV2-L
Image Classification	ImageNet	GFLOPs	8.8	EfficientNetV2-S (21k)

EfficientNetV2: Smaller Models and Faster Training

Abstract

Results

Related Papers

EfficientNetV2: Smaller Models and Faster Training

Abstract

Results

Related Papers