ACTKR

Reinforcement LearningIntroduced 20002 papers

Description

ACKTR, or Actor Critic with Kronecker-factored Trust Region, is an actor-critic method for reinforcement learning that applies trust region optimization using a recently proposed Kronecker-factored approximation to the curvature. The method extends the framework of natural policy gradient and optimizes both the actor and the critic using Kronecker-factored approximate curvature (K-FAC) with trust region.

Papers Using This Method

myGym: Modular Toolkit for Visuomotor Robotic Tasks2020-12-21 Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation2017-08-17