D4PG

Distributed Distributional DDPG

Reinforcement LearningIntroduced 200011 papers

Description

D4PG, or Distributed Distributional DDPG, is a policy gradient algorithm that extends upon the DDPG. The improvements include a distributional updates to the DDPG algorithm, combined with the use of multiple distributed workers all writing into the same replay table. The biggest performance gain of other simpler changes was the use of $N$ -step returns. The authors found that the use of prioritized experience replay was less crucial to the overall D4PG algorithm especially on harder problems.

Papers Using This Method

Learning in complex action spaces without policy gradients2024-10-08 Mitigating Estimation Errors by Twin TD-Regularized Actor and Critic for Deep Reinforcement Learning2023-11-07 SDGym: Low-Code Reinforcement Learning Environments using System Dynamics Models2023-10-19 A Long $N$-step Surrogate Stage Reward for Deep Reinforcement Learning2023-09-21 Gamma and Vega Hedging Using Deep Distributional Reinforcement Learning2022-05-10 Revisiting Gaussian mixture critics in off-policy reinforcement learning: a sample-based approach2022-04-21 Tonic: A Deep Reinforcement Learning Library for Fast Prototyping and Benchmarking2020-11-15 Distributed Uplink Beamforming in Cell-Free Networks Using Deep Reinforcement Learning2020-06-26 Sample-based Distributional Policy Gradient2020-01-08 TF-Replicator: Distributed Machine Learning for Researchers2019-02-01 Distributed Distributional Deterministic Policy Gradients2018-04-23