Bayesian REX

Bayesian Reward Extrapolation

Reinforcement LearningIntroduced 20001 papers

Description

Bayesian Reward Extrapolation is a Bayesian reward learning algorithm that scales to high-dimensional imitation learning problems by pre-training a low-dimensional feature encoding via self-supervised tasks and then leveraging preferences over demonstrations to perform fast Bayesian inference.

Papers Using This Method

Safe Imitation Learning via Fast Bayesian Reward Inference from Preferences2020-02-21