One-Shot Neural Architecture Search via Self-Evaluated Template Network

Xuanyi Dong, Yi Yang

2019-10-13ICCV 2019 10Neural Architecture Search

Paper PDF Code Code(official)Code Code(official)

Abstract

Neural architecture search (NAS) aims to automate the search procedure of architecture instead of manual design. Even if recent NAS approaches finish the search within days, lengthy training is still required for a specific architecture candidate to get the parameters for its accurate evaluation. Recently one-shot NAS methods are proposed to largely squeeze the tedious training process by sharing parameters across candidates. In this way, the parameters for each candidate can be directly extracted from the shared parameters instead of training them from scratch. However, they have no sense of which candidate will perform better until evaluation so that the candidates to evaluate are randomly sampled and the top-1 candidate is considered the best. In this paper, we propose a Self-Evaluated Template Network (SETN) to improve the quality of the architecture candidates for evaluation so that it is more likely to cover competitive candidates. SETN consists of two components: (1) an evaluator, which learns to indicate the probability of each individual architecture being likely to have a lower validation loss. The candidates for evaluation can thus be selectively sampled according to this evaluator. (2) a template network, which shares parameters among all candidates to amortize the training cost of generated candidates. In experiments, the architecture found by SETN achieves state-of-the-art performance on CIFAR and ImageNet benchmarks within comparable computation costs. Code is publicly available on GitHub: https://github.com/D-X-Y/AutoDL-Projects.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Val)	32.52	SETN
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Search time (s)	31010	SETN
Neural Architecture Search	CIFAR-10	Search Time (GPU days)	1.8	SETN (T=1K) + CutOut
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Test)	56.87	SETN
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Val)	59.05	SETN
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Search time (s)	31010	SETN
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Val)	32.52	SETN
AutoML	NAS-Bench-201, CIFAR-10	Search time (s)	31010	SETN
AutoML	CIFAR-10	Search Time (GPU days)	1.8	SETN (T=1K) + CutOut
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Test)	56.87	SETN
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Val)	59.05	SETN
AutoML	NAS-Bench-201, CIFAR-100	Search time (s)	31010	SETN

One-Shot Neural Architecture Search via Self-Evaluated Template Network

Abstract

Results

Related Papers

One-Shot Neural Architecture Search via Self-Evaluated Template Network

Abstract

Results

Related Papers