Searching for A Robust Neural Architecture in Four GPU Hours

Xuanyi Dong, Yi Yang

2019-10-10CVPR 2019 6Reinforcement Learning Neural Architecture Search

Paper PDF Code Code Code Code Code Code(official)

Abstract

Conventional neural architecture search (NAS) approaches are based on reinforcement learning or evolutionary strategy, which take more than 3000 GPU hours to find a good model on CIFAR-10. We propose an efficient NAS approach learning to search by gradient descent. Our approach represents the search space as a directed acyclic graph (DAG). This DAG contains billions of sub-graphs, each of which indicates a kind of neural architecture. To avoid traversing all the possibilities of the sub-graphs, we develop a differentiable sampler over the DAG. This sampler is learnable and optimized by the validation loss after training the sampled architecture. In this way, our approach can be trained in an end-to-end fashion by gradient descent, named Gradient-based search using Differentiable Architecture Sampler (GDAS). In experiments, we can finish one searching procedure in four GPU hours on CIFAR-10, and the discovered model obtains a test error of 2.82\% with only 2.5M parameters, which is on par with the state-of-the-art. Code is publicly available on GitHub: https://github.com/D-X-Y/NAS-Projects.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	41.71	GDAS
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Search time (s)	28926	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Accuracy (Test)	93.61	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Accuracy (Val)	89.89	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Search time (s)	28926	GDAS
Neural Architecture Search	CIFAR-10	Search Time (GPU days)	0.17	GDAS (FRC)
Neural Architecture Search	CIFAR-10	Search Time (GPU days)	0.21	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Test)	70.7	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Val)	71.34	GDAS
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Search time (s)	28926	GDAS
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	41.71	GDAS
AutoML	NAS-Bench-201, ImageNet-16-120	Search time (s)	28926	GDAS
AutoML	NAS-Bench-201, CIFAR-10	Accuracy (Test)	93.61	GDAS
AutoML	NAS-Bench-201, CIFAR-10	Accuracy (Val)	89.89	GDAS
AutoML	NAS-Bench-201, CIFAR-10	Search time (s)	28926	GDAS
AutoML	CIFAR-10	Search Time (GPU days)	0.17	GDAS (FRC)
AutoML	CIFAR-10	Search Time (GPU days)	0.21	GDAS
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Test)	70.7	GDAS
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Val)	71.34	GDAS
AutoML	NAS-Bench-201, CIFAR-100	Search time (s)	28926	GDAS

Searching for A Robust Neural Architecture in Four GPU Hours

Abstract

Results

Related Papers

Searching for A Robust Neural Architecture in Four GPU Hours

Abstract

Results

Related Papers