TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/DARTS: Differentiable Architecture Search

DARTS: Differentiable Architecture Search

Hanxiao Liu, Karen Simonyan, Yiming Yang

2018-06-24ICLR 2019 5Image ClassificationReinforcement LearningNeural Architecture SearchGeneral ClassificationLanguage Modellingreinforcement-learning
PaperPDFCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode(official)CodeCodeCodeCodeCodeCodeCodeCodeCodeCodeCode

Abstract

This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner. Unlike conventional approaches of applying evolution or reinforcement learning over a discrete and non-differentiable search space, our method is based on the continuous relaxation of the architecture representation, allowing efficient search of the architecture using gradient descent. Extensive experiments on CIFAR-10, ImageNet, Penn Treebank and WikiText-2 show that our algorithm excels in discovering high-performance convolutional architectures for image classification and recurrent architectures for language modeling, while being orders of magnitude faster than state-of-the-art non-differentiable techniques. Our implementation has been made publicly available to facilitate further research on efficient architecture search algorithms.

Results

TaskDatasetMetricValueModel
Language ModellingPenn Treebank (Word Level)Test perplexity56.1Differentiable NAS
Language ModellingPenn Treebank (Word Level)Validation perplexity58.3Differentiable NAS
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Accuracy (Test)16.43DARTS (first order)
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Search time (s)10890DARTS (first order)
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Accuracy (Test)16.43DARTS (second order)
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Search time (s)29902DARTS (second order)
Neural Architecture SearchCIFAR-10 Image ClassificationPercentage error2.83DARTS + c/o
Neural Architecture SearchCIFAR-10 Image ClassificationSearch Time (GPU days)4DARTS + c/o
Neural Architecture SearchCIFAR-10Parameters3.3DARTS (second order)
Neural Architecture SearchCIFAR-10Search Time (GPU days)4DARTS (second order)
Neural Architecture SearchCIFAR-10Parameters3.3DARTS (first order)
Neural Architecture SearchCIFAR-10Search Time (GPU days)1.5DARTS (first order)
Neural Architecture SearchImageNetAccuracy73.3DARTS
Neural Architecture SearchImageNetParams4.9DARTS
Neural Architecture SearchImageNetTop-1 Error Rate26.7DARTS
AutoMLNAS-Bench-201, ImageNet-16-120Accuracy (Test)16.43DARTS (first order)
AutoMLNAS-Bench-201, ImageNet-16-120Search time (s)10890DARTS (first order)
AutoMLNAS-Bench-201, ImageNet-16-120Accuracy (Test)16.43DARTS (second order)
AutoMLNAS-Bench-201, ImageNet-16-120Search time (s)29902DARTS (second order)
AutoMLCIFAR-10 Image ClassificationPercentage error2.83DARTS + c/o
AutoMLCIFAR-10 Image ClassificationSearch Time (GPU days)4DARTS + c/o
AutoMLCIFAR-10Parameters3.3DARTS (second order)
AutoMLCIFAR-10Search Time (GPU days)4DARTS (second order)
AutoMLCIFAR-10Parameters3.3DARTS (first order)
AutoMLCIFAR-10Search Time (GPU days)1.5DARTS (first order)
AutoMLImageNetAccuracy73.3DARTS
AutoMLImageNetParams4.9DARTS
AutoMLImageNetTop-1 Error Rate26.7DARTS

Related Papers

Visual-Language Model Knowledge Distillation Method for Image Quality Assessment2025-07-21Automatic Classification and Segmentation of Tunnel Cracks Based on Deep Learning and Visual Explanations2025-07-18CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18Adversarial attacks to image classification systems using evolutionary algorithms2025-07-17Efficient Adaptation of Pre-trained Vision Transformer underpinned by Approximately Orthogonal Fine-Tuning Strategy2025-07-17Federated Learning for Commercial Image Sources2025-07-17MUPAX: Multidimensional Problem Agnostic eXplainable AI2025-07-17VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17