$μ$DARTS: Model Uncertainty-Aware Differentiable Architecture Search

Biswadeep Chakraborty, Saibal Mukhopadhyay

2021-07-24Neural Architecture Search

Abstract

We present a Model Uncertainty-aware Differentiable ARchiTecture Search ($\mu$DARTS) that optimizes neural networks to simultaneously achieve high accuracy and low uncertainty. We introduce concrete dropout within DARTS cells and include a Monte-Carlo regularizer within the training loss to optimize the concrete dropout probabilities. A predictive variance term is introduced in the validation loss to enable searching for architecture with minimal model uncertainty. The experiments on CIFAR10, CIFAR100, SVHN, and ImageNet verify the effectiveness of $\mu$DARTS in improving accuracy and reducing uncertainty compared to existing DARTS methods. Moreover, the final architecture obtained from $\mu$DARTS shows higher robustness to noise at the input image and model parameters compared to the architecture obtained from existing DARTS methods.

Results

Task	Dataset	Metric	Value	Model
Neural Architecture Search	CIFAR-100	Percentage Error	19.39	μDARTS
Neural Architecture Search	CIFAR-100	Search Time (GPU days)	1.57	μDARTS
Neural Architecture Search	CIFAR-10	Search Time (GPU days)	0.1	μDARTS
Neural Architecture Search	ImageNet	Accuracy	78.76	μDARTS
Neural Architecture Search	ImageNet	Top-1 Error Rate	21.24	μDARTS
AutoML	CIFAR-100	Percentage Error	19.39	μDARTS
AutoML	CIFAR-100	Search Time (GPU days)	1.57	μDARTS
AutoML	CIFAR-10	Search Time (GPU days)	0.1	μDARTS
AutoML	ImageNet	Accuracy	78.76	μDARTS
AutoML	ImageNet	Top-1 Error Rate	21.24	μDARTS

Related Papers

DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17 AnalogNAS-Bench: A NAS Benchmark for Analog In-Memory Computing2025-06-23 From Tiny Machine Learning to Tiny Deep Learning: A Survey2025-06-21 One-Shot Neural Architecture Search with Network Similarity Directed Initialization for Pathological Image Classification2025-06-17 DDS-NAS: Dynamic Data Selection within Neural Architecture Search via On-line Hard Example Mining applied to Image Classification2025-06-17 MARCO: Hardware-Aware Neural Architecture Search for Edge Devices with Multi-Agent Reinforcement Learning and Conformal Prediction Filtering2025-06-16 Finding Optimal Kernel Size and Dimension in Convolutional Neural Networks An Architecture Optimization Approach2025-06-16 Directed Acyclic Graph Convolutional Networks2025-06-13