TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Searching for A Robust Neural Architecture in Four GPU Hours

Searching for A Robust Neural Architecture in Four GPU Hours

Xuanyi Dong, Yi Yang

2019-10-10CVPR 2019 6Reinforcement LearningNeural Architecture Search
PaperPDFCodeCodeCodeCodeCodeCode(official)

Abstract

Conventional neural architecture search (NAS) approaches are based on reinforcement learning or evolutionary strategy, which take more than 3000 GPU hours to find a good model on CIFAR-10. We propose an efficient NAS approach learning to search by gradient descent. Our approach represents the search space as a directed acyclic graph (DAG). This DAG contains billions of sub-graphs, each of which indicates a kind of neural architecture. To avoid traversing all the possibilities of the sub-graphs, we develop a differentiable sampler over the DAG. This sampler is learnable and optimized by the validation loss after training the sampled architecture. In this way, our approach can be trained in an end-to-end fashion by gradient descent, named Gradient-based search using Differentiable Architecture Sampler (GDAS). In experiments, we can finish one searching procedure in four GPU hours on CIFAR-10, and the discovered model obtains a test error of 2.82\% with only 2.5M parameters, which is on par with the state-of-the-art. Code is publicly available on GitHub: https://github.com/D-X-Y/NAS-Projects.

Results

TaskDatasetMetricValueModel
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Accuracy (Test)41.71GDAS
Neural Architecture SearchNAS-Bench-201, ImageNet-16-120Search time (s)28926GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-10Accuracy (Test)93.61GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-10Accuracy (Val)89.89GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-10Search time (s)28926GDAS
Neural Architecture SearchCIFAR-10Search Time (GPU days)0.17GDAS (FRC)
Neural Architecture SearchCIFAR-10Search Time (GPU days)0.21GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-100Accuracy (Test)70.7GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-100Accuracy (Val)71.34GDAS
Neural Architecture SearchNAS-Bench-201, CIFAR-100Search time (s)28926GDAS
AutoMLNAS-Bench-201, ImageNet-16-120Accuracy (Test)41.71GDAS
AutoMLNAS-Bench-201, ImageNet-16-120Search time (s)28926GDAS
AutoMLNAS-Bench-201, CIFAR-10Accuracy (Test)93.61GDAS
AutoMLNAS-Bench-201, CIFAR-10Accuracy (Val)89.89GDAS
AutoMLNAS-Bench-201, CIFAR-10Search time (s)28926GDAS
AutoMLCIFAR-10Search Time (GPU days)0.17GDAS (FRC)
AutoMLCIFAR-10Search Time (GPU days)0.21GDAS
AutoMLNAS-Bench-201, CIFAR-100Accuracy (Test)70.7GDAS
AutoMLNAS-Bench-201, CIFAR-100Accuracy (Val)71.34GDAS
AutoMLNAS-Bench-201, CIFAR-100Search time (s)28926GDAS

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17