Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

DDPG

Deep Deterministic Policy Gradient

Reinforcement LearningIntroduced 2000218 papers

Description

DDPG, or Deep Deterministic Policy Gradient, is an actor-critic, model-free algorithm based on the deterministic policy gradient that can operate over continuous action spaces. It combines the actor-critic approach with insights from DQNs: in particular, the insights that 1) the network is trained off-policy with samples from a replay buffer to minimize correlations between samples, and 2) the network is trained with a target Q network to give consistent targets during temporal difference backups. DDPG makes use of the same ideas along with batch normalization.

Papers Using This Method

Multi-Objective Reinforcement Learning for Cognitive Radar Resource Management2025-06-25 Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning2025-06-08 A Novel Deep Reinforcement Learning Method for Computation Offloading in Multi-User Mobile Edge Computing with Decentralization2025-06-03 LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models2025-05-21 Deep reinforcement learning-based longitudinal control strategy for automated vehicles at signalised intersections2025-05-13 Moderate Actor-Critic Methods: Controlling Overestimation Bias via Expectile Loss2025-04-14 Intelligent Joint Security and Delay Determinacy Performance Guarantee Strategy in RIS-Assisted IIoT Communication Systems2025-03-11 EvoRL: A GPU-accelerated Framework for Evolutionary Reinforcement Learning2025-01-25 Federated Deep Reinforcement Learning for Energy Efficient Multi-Functional RIS-Assisted Low-Earth Orbit Networks2025-01-19 Dynamic Portfolio Optimization via Augmented DDPG with Quantum Price Levels-Based Trading Strategy2025-01-15 AutoLoop: Fast Visual SLAM Fine-tuning through Agentic Curriculum Learning2025-01-15 An Advantage-based Optimization Method for Reinforcement Learning in Large Action Space2024-12-17 Broad Critic Deep Actor Reinforcement Learning for Continuous Control2024-11-24 Provably Efficient Action-Manipulation Attack Against Continuous Reinforcement Learning2024-11-20 Quantum Policy Gradient in Reproducing Kernel Hilbert Space2024-11-11 Mitigating Suboptimality of Deterministic Policy Gradients in Complex Q-functions2024-10-15 ETGL-DDPG: A Deep Deterministic Policy Gradient Algorithm for Sparse Reward Continuous Control2024-10-07 Robust Deep Reinforcement Learning for Volt-VAR Optimization in Active Distribution System under Uncertainty2024-09-27 FH-DRL: Exponential-Hyperbolic Frontier Heuristics with DRL for accelerated Exploration in Unknown Environments2024-07-26 The Cross-environment Hyperparameter Setting Benchmark for Reinforcement Learning2024-07-26