Optimizing the Neural Architecture of Reinforcement Learning Agents

N. Mazyavkina, S. Moustafa, I. Trofimov, E. Burnaev

2020-11-30Reinforcement Learning Atari Games Neural Architecture Search Meta Reinforcement Learning reinforcement-learning

Paper PDF Code(official)

Abstract

Reinforcement learning (RL) enjoyed significant progress over the last years. One of the most important steps forward was the wide application of neural networks. However, architectures of these neural networks are typically constructed manually. In this work, we study recently proposed neural architecture search (NAS) methods for optimizing the architecture of RL agents. We carry out experiments on the Atari benchmark and conclude that modern NAS methods find architectures of RL agents outperforming a manually selected one.

Results

Task	Dataset	Metric	Value	Model
Atari Games	Atari 2600 Freeway	Score	22	ENAS
Atari Games	Atari 2600 Freeway	Score	22	SPOS
Atari Games	Atari 2600 Breakout	Score	180.6	SPOS
Atari Games	Atari 2600 Breakout	Score	161.1	ENAS Search space 1
Atari Games	Atari 2600 Breakout	Score	144.4	SPOS Search space 1
Atari Games	Atari 2600 Breakout	Score	91.4	ENAS
Video Games	Atari 2600 Freeway	Score	22	ENAS
Video Games	Atari 2600 Freeway	Score	22	SPOS
Video Games	Atari 2600 Breakout	Score	180.6	SPOS
Video Games	Atari 2600 Breakout	Score	161.1	ENAS Search space 1
Video Games	Atari 2600 Breakout	Score	144.4	SPOS Search space 1
Video Games	Atari 2600 Breakout	Score	91.4	ENAS

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17 VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17 QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17 Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17