TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Value Prediction Network

Value Prediction Network

Junhyuk Oh, Satinder Singh, Honglak Lee

2017-07-11NeurIPS 2017 12Reinforcement LearningAtari GamesPrediction
PaperPDFCodeCode(official)

Abstract

This paper proposes a novel deep reinforcement learning (RL) architecture, called Value Prediction Network (VPN), which integrates model-free and model-based RL methods into a single neural network. In contrast to typical model-based RL methods, VPN learns a dynamics model whose abstract states are trained to make option-conditional predictions of future values (discounted sum of rewards) rather than of future observations. Our experimental results show that VPN has several advantages over both model-free and model-based baselines in a stochastic environment where careful planning is required but building an accurate observation-prediction model is difficult. Furthermore, VPN outperforms Deep Q-Network (DQN) on several Atari games even with short-lookahead planning, demonstrating its potential as a new way of learning a good state representation.

Results

TaskDatasetMetricValueModel
Atari GamesAtari 2600 Ms. PacmanScore2689VPN
Atari GamesAtari 2600 EnduroScore382VPN
Atari GamesAtari 2600 KrullScore15930VPN
Atari GamesAtari 2600 FrostbiteScore3811VPN
Atari GamesAtari 2600 AmidarScore641VPN
Atari GamesAtari 2600 Crazy ClimberScore54119VPN
Atari GamesAtari 2600 AlienScore1429VPN
Atari GamesAtari 2600 SeaquestScore5628VPN
Atari GamesAtari 2600 Q*BertScore14517VPN
Video GamesAtari 2600 Ms. PacmanScore2689VPN
Video GamesAtari 2600 EnduroScore382VPN
Video GamesAtari 2600 KrullScore15930VPN
Video GamesAtari 2600 FrostbiteScore3811VPN
Video GamesAtari 2600 AmidarScore641VPN
Video GamesAtari 2600 Crazy ClimberScore54119VPN
Video GamesAtari 2600 AlienScore1429VPN
Video GamesAtari 2600 SeaquestScore5628VPN
Video GamesAtari 2600 Q*BertScore14517VPN

Related Papers

Multi-Strategy Improved Snake Optimizer Accelerated CNN-LSTM-Attention-Adaboost for Trajectory Prediction2025-07-21CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17