TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DARTS-PRIME: Regularization and Scheduling Improve Constra...

DARTS-PRIME: Regularization and Scheduling Improve Constrained Optimization in Differentiable NAS

Kaitlin Maile, Erwan Lecarpentier, Hervé Luga, Dennis G. Wilson

2021-06-22Neural Architecture SearchScheduling
PaperPDF

Abstract

Differentiable Architecture Search (DARTS) is a recent neural architecture search (NAS) method based on a differentiable relaxation. Due to its success, numerous variants analyzing and improving parts of the DARTS framework have recently been proposed. By considering the problem as a constrained bilevel optimization, we present and analyze DARTS-PRIME, a variant including improvements to architectural weight update scheduling and regularization towards discretization. We propose a dynamic schedule based on per-minibatch network information to make architecture updates more informed, as well as proximity regularization to promote well-separated discretization. Our results in multiple domains show that DARTS-PRIME improves both performance and reliability, comparable to state-of-the-art in differentiable NAS.

Results

TaskDatasetMetricValueModel
Neural Architecture SearchCIFAR-100Percentage Error17.44DARTS-PRIME
Neural Architecture SearchCIFAR-10Search Time (GPU days)0.5DARTS-PRIME
AutoMLCIFAR-100Percentage Error17.44DARTS-PRIME
AutoMLCIFAR-10Search Time (GPU days)0.5DARTS-PRIME

Related Papers

An End-to-End DNN Inference Framework for the SpiNNaker2 Neuromorphic MPSoC2025-07-18DASViT: Differentiable Architecture Search for Vision Transformer2025-07-17Fremer: Lightweight and Effective Frequency Transformer for Workload Forecasting in Cloud Services2025-07-17Transient-Stability-Aware Frequency Provision in IBR-Rich Grids via Information Gap Decision Theory and Deep Learning2025-07-17Leveraging Asynchronous Cross-border Market Data for Improved Day-Ahead Electricity Price Forecasting in European Markets2025-07-17FormulaOne: Measuring the Depth of Algorithmic Reasoning Beyond Competitive Programming2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17SystolicAttention: Fusing FlashAttention within a Single Systolic Array2025-07-15