Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Methods/MADDPG

MADDPG

Reinforcement LearningIntroduced 200036 papers

Description

MADDPG, or Multi-agent DDPG, extends DDPG into a multi-agent policy gradient algorithm where decentralized agents learn a centralized critic based on the observations and actions of all agents. It leads to learned policies that only use local information (i.e. their own observations) at execution time, does not assume a differentiable model of the environment dynamics or any particular structure on the communication method between agents, and is applicable not only to cooperative interaction but to competitive or mixed interaction involving both physical and communicative behavior. The critic is augmented with extra information about the policies of other agents, while the actor only has access to local information. After training is completed, only the local actors are used at execution phase, acting in a decentralized manner.

Papers Using This Method

Fully-Decentralized MADDPG with Networked Agents2025-03-09 Cooperative Multi-Agent Deep Reinforcement Learning in Content Ranking Optimization2024-08-08 An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning2024-05-10 Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing2024-02-18 Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains2023-12-09 Adaptive Resource Management for Edge Network Slicing using Incremental Multi-Agent Deep Reinforcement Learning2023-10-26 Safe Hierarchical Reinforcement Learning for CubeSat Task Scheduling Based on Energy Consumption2023-09-21 Progression Cognition Reinforcement Learning with Prioritized Experience for Multi-Vehicle Pursuit2023-06-08 Reinforcement Learning With Reward Machines in Stochastic Games2023-05-27 Revisiting the Gumbel-Softmax in MADDPG2023-02-23 On Multi-Agent Deep Deterministic Policy Gradients and their Explainability for SMARTS Environment2023-01-20 Multiagent Reinforcement Learning Based on Fusion-Multiactor-Attention-Critic for Multiple-Unmanned-Aerial-Vehicle Navigation Control2022-10-10 A New Approach to Training Multiple Cooperative Agents for Autonomous Driving2022-09-05 Two-Hop Age of Information Scheduling for Multi-UAV Assisted Mobile Edge Computing: FRL vs MADDPG2022-06-19 Balancing Profit, Risk, and Sustainability for Portfolio Management2022-06-06 MA-Dreamer: Coordination and communication through shared imagination2022-04-10 Decision-making of Emergent Incident based on P-MADDPG2022-03-19 Learning to Infer Belief Embedded Communication2022-03-15 Decentralized Multi-Agent Reinforcement Learning: An Off-Policy Method2021-10-31 Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning2021-09-23