Stochastic Dueling Network

Reinforcement LearningIntroduced 200012 papers

Description

A Stochastic Dueling Network, or SDN, is an architecture for learning a value function $V$ . The SDN learns both $V$ and $Q$ off-policy while maintaining consistency between the two estimates. At each time step it outputs a stochastic estimate of $Q$ and a deterministic estimate of $V$ .

Papers Using This Method

Dynamics of Resource Allocation in O-RANs: An In-depth Exploration of On-Policy and Off-Policy Deep Reinforcement Learning for Real-Time Applications2024-11-17 Joint Physical-Digital Facial Attack Detection Via Simulating Spoofing Clues2024-04-12 Distributional Estimation of Data Uncertainty for Surveillance Face Anti-spoofing2023-09-18 PDVN: A Patch-based Dual-view Network for Face Liveness Detection using Light Field Focal Stack2023-01-17 Asynchronous Curriculum Experience Replay: A Deep Reinforcement Learning Approach for UAV Autonomous Motion Control in Unknown Dynamic Environments2022-07-04 Is Word Error Rate a good evaluation metric for Speech Recognition in Indic Languages?2022-03-30 Learning Reward Machines: A Study in Partially Observable Reinforcement Learning2021-12-17 A-DeepPixBis: Attentional Angular Margin for Face Anti-Spoofing2021-03-01 Learning Reward Machines for Partially Observable Reinforcement Learning2019-12-01 Sample Efficient Deep Reinforcement Learning for Dialogue Systems with Large Action Spaces2018-02-11 Pretraining Deep Actor-Critic Reinforcement Learning Algorithms With Expert Demonstrations2018-01-31 Sample Efficient Actor-Critic with Experience Replay2016-11-03