Playing Atari with Deep Reinforcement Learning

Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, Martin Riedmiller

2013-12-19Multi-Goal Reinforcement Learning Reinforcement Learning Atari Games Q-Learning

Abstract

We present the first deep learning model to successfully learn control policies directly from high-dimensional sensory input using reinforcement learning. The model is a convolutional neural network, trained with a variant of Q-learning, whose input is raw pixels and whose output is a value function estimating future rewards. We apply our method to seven Atari 2600 games from the Arcade Learning Environment, with no adjustment of the architecture or learning algorithm. We find that it outperforms all previous approaches on six of the games and surpasses a human expert on three of them.

Results

Task	Dataset	Metric	Value	Model
Atari Games	Atari 2600 Pong	Score	21	DQN Best
Atari Games	Atari 2600 Enduro	Score	661	DQN Best
Atari Games	Atari 2600 Breakout	Score	225	DQN Best
Atari Games	Atari 2600 Space Invaders	Score	1075	DQN Best
Atari Games	Atari 2600 Beam Rider	Score	5184	DQN Best
Atari Games	Atari 2600 Seaquest	Score	1740	DQN Best
Atari Games	Atari 2600 Q*Bert	Score	4500	DQN Best
Video Games	Atari 2600 Pong	Score	21	DQN Best
Video Games	Atari 2600 Enduro	Score	661	DQN Best
Video Games	Atari 2600 Breakout	Score	225	DQN Best
Video Games	Atari 2600 Space Invaders	Score	1075	DQN Best
Video Games	Atari 2600 Beam Rider	Score	5184	DQN Best
Video Games	Atari 2600 Seaquest	Score	1740	DQN Best
Video Games	Atari 2600 Q*Bert	Score	4500	DQN Best

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18 VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17 Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17 Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17 VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17 QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17 Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17 Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17