Geometry-Aware Gradient Algorithms for Neural Architecture Search

Liam Li, Mikhail Khodak, Maria-Florina Balcan, Ameet Talwalkar

2020-04-16ICLR 2021 1Neural Architecture Search

Abstract

Recent state-of-the-art methods for neural architecture search (NAS) exploit gradient-based optimization by relaxing the problem into continuous optimization over architectures and shared-weights, a noisy process that remains poorly understood. We argue for the study of single-level empirical risk minimization to understand NAS with weight-sharing, reducing the design of NAS methods to devising optimizers and regularizers that can quickly obtain high-quality solutions to this problem. Invoking the theory of mirror descent, we present a geometry-aware framework that exploits the underlying structure of this optimization to return sparse architectural parameters, leading to simple yet novel algorithms that enjoy fast convergence guarantees and achieve state-of-the-art accuracy on the latest NAS benchmarks in computer vision. Notably, we exceed the best published results for both CIFAR and ImageNet on both the DARTS search space and NAS-Bench201; on the latter we achieve near-oracle-optimal performance on CIFAR-10 and CIFAR-100. Together, our theory and experiments demonstrate a principled way to co-design optimizers and continuous relaxations of discrete NAS search spaces.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.36	GAEA DARTS (ERM)
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Accuracy (Test)	94.1	GAEA DARTS (ERM)
Neural Architecture Search	ImageNet	Params	5.6	GAEA PC-DARTS
Neural Architecture Search	ImageNet	Top-1 Error Rate	24	GAEA PC-DARTS
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Test)	73.43	GAEA DARTS (ERM)
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.36	GAEA DARTS (ERM)
AutoML	NAS-Bench-201, CIFAR-10	Accuracy (Test)	94.1	GAEA DARTS (ERM)
AutoML	ImageNet	Params	5.6	GAEA PC-DARTS
AutoML	ImageNet	Top-1 Error Rate	24	GAEA PC-DARTS
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Test)	73.43	GAEA DARTS (ERM)

Geometry-Aware Gradient Algorithms for Neural Architecture Search

Abstract

Results

Related Papers

Geometry-Aware Gradient Algorithms for Neural Architecture Search

Abstract

Results

Related Papers