TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/Fully Parameterized Quantile Function for Distributional R...

Fully Parameterized Quantile Function for Distributional Reinforcement Learning

Derek Yang, Li Zhao, Zichuan Lin, Tao Qin, Jiang Bian, Tie-Yan Liu

2019-11-05NeurIPS 2019 12Reinforcement LearningAtari Gamesreinforcement-learning
PaperPDFCodeCodeCodeCodeCodeCode

Abstract

Distributional Reinforcement Learning (RL) differs from traditional RL in that, rather than the expectation of total returns, it estimates distributions and has achieved state-of-the-art performance on Atari Games. The key challenge in practical distributional RL algorithms lies in how to parameterize estimated distributions so as to better approximate the true continuous distribution. Existing distributional RL algorithms parameterize either the probability side or the return value side of the distribution function, leaving the other side uniformly fixed as in C51, QR-DQN or randomly sampled as in IQN. In this paper, we propose fully parameterized quantile function that parameterizes both the quantile fraction axis (i.e., the x-axis) and the value axis (i.e., y-axis) for distributional RL. Our algorithm contains a fraction proposal network that generates a discrete set of quantile fractions and a quantile value network that gives corresponding quantile values. The two networks are jointly trained to find the best approximation of the true distribution. Experiments on 55 Atari Games show that our algorithm significantly outperforms existing distributional RL algorithms and creates a new record for the Atari Learning Environment for non-distributed agents.

Results

TaskDatasetMetricValueModel
Atari GamesAtari 2600 SkiingScore-9085.3FQF
Atari GamesAtari 2600 Ms. PacmanScore7631.9FQF
Atari GamesAtari 2600 BreakoutScore854.2FQF
Atari GamesAtari 2600 FrostbiteScore214060Fearlessmrx
Atari GamesAtari 2600 Space InvadersScore46498.3FQF
Atari GamesAtari 2600 James BondScore87291.7FQF
Atari GamesAtari 2600 AmidarScore3165.3FQF
Atari GamesAtari 2600 Crazy ClimberScore223470.6FQF
Atari GamesAtari 2600 AsteroidsScore4553FQF
Atari GamesAtari 2600 GravitarScore1406FQF
Atari GamesAtari 2600 Battle ZoneScore87928.6FQF
Atari GamesAtari 2600 PhoenixScore174077.5FQF
Atari GamesAtari 2600 AsterixScore578388.5FQF
Atari GamesAtari 2600 Kung-Fu MasterScore111138.5FQF
Atari GamesAtari 2600 BowlingScore102.3FQF
Atari GamesAtari 2600 AlienScore16754.6FQF
Atari GamesAtari 2600 Fishing DerbyScore52.7FQF
Atari GamesAtari 2600 Chopper CommandScore876460FQF
Atari GamesAtari 2600 Wizard of WorScore44782.6FQF
Atari GamesAtari 2600 RobotankScore75.7FQF
Atari GamesAtari 2600 Star GunnerScore131981.2FQF
Atari GamesAtari 2600 Ice HockeyScore17.3FQF
Atari GamesAtari 2600 BerzerkScore12422.2FQF
Atari GamesAtari 2600 HEROScore30926.2FQF
Atari GamesAtari 2600 River RaidScore23560.7FQF
Video GamesAtari 2600 SkiingScore-9085.3FQF
Video GamesAtari 2600 Ms. PacmanScore7631.9FQF
Video GamesAtari 2600 BreakoutScore854.2FQF
Video GamesAtari 2600 FrostbiteScore214060Fearlessmrx
Video GamesAtari 2600 Space InvadersScore46498.3FQF
Video GamesAtari 2600 James BondScore87291.7FQF
Video GamesAtari 2600 AmidarScore3165.3FQF
Video GamesAtari 2600 Crazy ClimberScore223470.6FQF
Video GamesAtari 2600 AsteroidsScore4553FQF
Video GamesAtari 2600 GravitarScore1406FQF
Video GamesAtari 2600 Battle ZoneScore87928.6FQF
Video GamesAtari 2600 PhoenixScore174077.5FQF
Video GamesAtari 2600 AsterixScore578388.5FQF
Video GamesAtari 2600 Kung-Fu MasterScore111138.5FQF
Video GamesAtari 2600 BowlingScore102.3FQF
Video GamesAtari 2600 AlienScore16754.6FQF
Video GamesAtari 2600 Fishing DerbyScore52.7FQF
Video GamesAtari 2600 Chopper CommandScore876460FQF
Video GamesAtari 2600 Wizard of WorScore44782.6FQF
Video GamesAtari 2600 RobotankScore75.7FQF
Video GamesAtari 2600 Star GunnerScore131981.2FQF
Video GamesAtari 2600 Ice HockeyScore17.3FQF
Video GamesAtari 2600 BerzerkScore12422.2FQF
Video GamesAtari 2600 HEROScore30926.2FQF
Video GamesAtari 2600 River RaidScore23560.7FQF

Related Papers

CUDA-L1: Improving CUDA Optimization via Contrastive Reinforcement Learning2025-07-18VisionThink: Smart and Efficient Vision Language Model via Reinforcement Learning2025-07-17Spectral Bellman Method: Unifying Representation and Exploration in RL2025-07-17Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17VAR-MATH: Probing True Mathematical Reasoning in Large Language Models via Symbolic Multi-Instance Benchmarks2025-07-17QuestA: Expanding Reasoning Capacity in LLMs via Question Augmentation2025-07-17Inverse Reinforcement Learning Meets Large Language Model Post-Training: Basics, Advances, and Opportunities2025-07-17Autonomous Resource Management in Microservice Systems via Reinforcement Learning2025-07-17