$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Peng Ye, Baopu Li, Yikang Li, Tao Chen, Jiayuan Fan, Wanli Ouyang

2022-03-03Neural Architecture Search

Abstract

Neural Architecture Search~(NAS) has attracted increasingly more attention in recent years because of its capability to design deep neural networks automatically. Among them, differential NAS approaches such as DARTS, have gained popularity for the search efficiency. However, they suffer from two main issues, the weak robustness to the performance collapse and the poor generalization ability of the searched architectures. To solve these two problems, a simple-but-efficient regularization method, termed as Beta-Decay, is proposed to regularize the DARTS-based NAS searching process. Specifically, Beta-Decay regularization can impose constraints to keep the value and variance of activated architecture parameters from too large. Furthermore, we provide in-depth theoretical analysis on how it works and why it works. Experimental results on NAS-Bench-201 show that our proposed method can help to stabilize the searching process and makes the searched network more transferable across different datasets. In addition, our search scheme shows an outstanding property of being less dependent on training time and data. Comprehensive experiments on a variety of search spaces and datasets validate the effectiveness of the proposed method.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.71	β-SDARTS-RS
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.71	β-RDARTS-L2
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.34	β-DARTS
Neural Architecture Search	NAS-Bench-201, ImageNet-16-120	Accuracy (Val)	46.37	β-DARTS
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Accuracy (Test)	94.36	β-DARTS
Neural Architecture Search	NAS-Bench-201, CIFAR-10	Accuracy (Val)	91.55	β-DARTS
Neural Architecture Search	CIFAR-100	Percentage Error	16.52	β-DARTS
Neural Architecture Search	ImageNet	Top-1 Error Rate	23.9	b-DARTS (CIFAR-10)
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Test)	73.51	β-DARTS
Neural Architecture Search	NAS-Bench-201, CIFAR-100	Accuracy (Val)	73.49	β-DARTS
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.71	β-SDARTS-RS
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.71	β-RDARTS-L2
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Test)	46.34	β-DARTS
AutoML	NAS-Bench-201, ImageNet-16-120	Accuracy (Val)	46.37	β-DARTS
AutoML	NAS-Bench-201, CIFAR-10	Accuracy (Test)	94.36	β-DARTS
AutoML	NAS-Bench-201, CIFAR-10	Accuracy (Val)	91.55	β-DARTS
AutoML	CIFAR-100	Percentage Error	16.52	β-DARTS
AutoML	ImageNet	Top-1 Error Rate	23.9	b-DARTS (CIFAR-10)
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Test)	73.51	β-DARTS
AutoML	NAS-Bench-201, CIFAR-100	Accuracy (Val)	73.49	β-DARTS

$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Abstract

Results

Related Papers

$β$-DARTS: Beta-Decay Regularization for Differentiable Architecture Search

Abstract

Results

Related Papers