Tasks SotA Datasets Papers Methods Submit About

Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable Benchmarks All SotA Datasets Papers Methods

Community

Submit Results About

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

DQN

Deep Q-Network

Reinforcement LearningIntroduced 2000519 papers

Description

A DQN, or Deep Q-Network, approximates a state-value function in a Q-Learning framework with a neural network. In the Atari Games case, they take in several frames of the game as an input and output state values for each action as an output.

It is usually used in conjunction with Experience Replay, for storing the episode steps in memory for off-policy learning, where samples are drawn from the replay memory at random. Additionally, the Q-Network is usually optimized towards a frozen target network that is periodically updated with the latest weights every $k$ steps (where $k$ is a hyperparameter). The latter makes training more stable by preventing short-term oscillations from a moving target. The former tackles autocorrelation that would occur from on-line learning, and having a replay memory makes the problem more like a supervised learning problem.

Image Source: here

Papers Using This Method

Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound2025-07-15 Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study2025-07-08 2048: Reinforcement Learning in a Delayed Reward Environment2025-07-07 VRAIL: Vectorized Reward-based Attribution for Interpretable Learning2025-06-19 GCN-Driven Reinforcement Learning for Probabilistic Real-Time Guarantees in Industrial URLLC2025-06-17 Reliable Critics: Monotonic Improvement and Convergence Guarantees for Reinforcement Learning2025-06-08 Getting More from Less: Transfer Learning Improves Sleep Stage Decoding Accuracy in Peripheral Wearable Devices2025-05-31 Combining Deep Architectures for Information Gain estimation and Reinforcement Learning for multiagent field exploration2025-05-29 The Cell Must Go On: Agar.io for Continual Reinforcement Learning2025-05-23 LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models2025-05-21 Automatic Reward Shaping from Confounded Offline Data2025-05-16 Reinforcement Learning for Game-Theoretic Resource Allocation on Graphs2025-05-08 Interpretable Learning Dynamics in Unsupervised Reinforcement Learning2025-05-06 Universal Approximation Theorem of Deep Q-Networks2025-05-04 Approximation to Deep Q-Network by Stochastic Delay Differential Equations2025-05-01 State-Aware IoT Scheduling Using Deep Q-Networks and Edge-Based Coordination2025-04-22 AlphaGrad: Non-Linear Gradient Normalization Optimizer2025-04-22 Graph Based Deep Reinforcement Learning Aided by Transformers for Multi-Agent Cooperation2025-04-11 Dynamic Operating System Scheduling Using Double DQN: A Reinforcement Learning Approach to Task Optimization2025-03-31 Deep Q-Learning with Gradient Target Tracking2025-03-20