MSR-DARTS: Minimum Stable Rank of Differentiable Architecture Search

Kengo Machida, Kuniaki Uto, Koichi Shinoda, Taiji Suzuki

2020-09-19Neural Architecture Search

Abstract

In neural architecture search (NAS), differentiable architecture search (DARTS) has recently attracted much attention due to its high efficiency. It defines an over-parameterized network with mixed edges, each of which represents all operator candidates, and jointly optimizes the weights of the network and its architecture in an alternating manner. However, this method finds a model with the weights converging faster than the others, and such a model with fastest convergence often leads to overfitting. Accordingly, the resulting model cannot always be well-generalized. To overcome this problem, we propose a method called minimum stable rank DARTS (MSR-DARTS), for finding a model with the best generalization error by replacing architecture optimization with the selection process using the minimum stable rank criterion. Specifically, a convolution operator is represented by a matrix, and MSR-DARTS selects the one with the smallest stable rank. We evaluated MSR-DARTS on CIFAR-10 and ImageNet datasets. It achieves an error rate of 2.54% with 4.0M parameters within 0.3 GPU-days on CIFAR-10, and a top-1 error rate of 23.9% on ImageNet. The official code is available at https://github.com/mtaecchhi/msrdarts.git.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	CIFAR-10	Search Time (GPU days)	0.3	MSR-DARTS
Neural Architecture Search	ImageNet	Accuracy	76.1	MSR-DARTS (CIFAR-10)
Neural Architecture Search	ImageNet	Top-1 Error Rate	23.9	MSR-DARTS (CIFAR-10)
AutoML	CIFAR-10	Search Time (GPU days)	0.3	MSR-DARTS
AutoML	ImageNet	Accuracy	76.1	MSR-DARTS (CIFAR-10)
AutoML	ImageNet	Top-1 Error Rate	23.9	MSR-DARTS (CIFAR-10)

Related Papers

DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17 AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing2025-06-23 From Tiny Machine Learning to Tiny Deep Learning: A Survey2025-06-21 One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification2025-06-17 DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification2025-06-17 MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering2025-06-16 Finding Optimal Kernel Size and Dimension in Convolutional Neural Networks An Architecture Optimization Approach2025-06-16 Directed Acyclic Graph Convolutional Networks2025-06-13