TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Count-Based Exploration with the Successor Representation

Count-Based Exploration with the Successor Representation

Marlos C. Machado, Marc G. Bellemare, Michael Bowling

2018-07-31ICLR 2019 5Reinforcement LearningAtari GamesEfficient Exploration
PaperPDFCode(official)Code

Abstract

In this paper we introduce a simple approach for exploration in reinforcement learning (RL) that allows us to develop theoretically justified algorithms in the tabular case but that is also extendable to settings where function approximation is required. Our approach is based on the successor representation (SR), which was originally introduced as a representation defining state generalization by the similarity of successor states. Here we show that the norm of the SR, while it is being learned, can be used as a reward bonus to incentivize exploration. In order to better understand this transient behavior of the norm of the SR we introduce the substochastic successor representation (SSR) and we show that it implicitly counts the number of times each state (or feature) has been observed. We use this result to introduce an algorithm that performs as well as some theoretically sample-efficient approaches. Finally, we extend these ideas to a deep RL algorithm and show that it achieves state-of-the-art performance in Atari 2600 games when in a low sample-complexity regime.

Results

TaskDatasetMetricValueModel
Atari GamesAtari 2600 FreewayScore29.5DQNMMCe
Atari GamesAtari 2600 Montezuma's RevengeScore1778.8DQN+SR
Atari GamesAtari 2600 Montezuma's RevengeScore1778.6DQNMMCe+SR
Atari GamesAtari 2600 GravitarScore1078.3DQNMMCe
Atari GamesAtari 2600 SolarisScore2244.6DQNMMCe
Atari GamesAtari 2600 VentureScore1241.8DQNMMCe+SR
Atari GamesAtari 2600 Private EyeScore99.1DQNMMCe+SR
Video GamesAtari 2600 FreewayScore29.5DQNMMCe
Video GamesAtari 2600 Montezuma's RevengeScore1778.8DQN+SR
Video GamesAtari 2600 Montezuma's RevengeScore1778.6DQNMMCe+SR
Video GamesAtari 2600 GravitarScore1078.3DQNMMCe
Video GamesAtari 2600 SolarisScore2244.6DQNMMCe
Video GamesAtari 2600 VentureScore1241.8DQNMMCe+SR
Video GamesAtari 2600 Private EyeScore99.1DQNMMCe+SR

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17