GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Raphael Chekroun, Marin Toromanoff, Sascha Hornauer, Fabien Moutarde

2021-11-16MuJoCo Continuous Control Decision Making Autonomous Driving CARLA MAP Leaderboard

Abstract

Deep reinforcement learning (DRL) has been demonstrated to be effective for several complex decision-making applications such as autonomous driving and robotics. However, DRL is notoriously limited by its high sample complexity and its lack of stability. Prior knowledge, e.g. as expert demonstrations, is often available but challenging to leverage to mitigate these issues. In this paper, we propose General Reinforced Imitation (GRI), a novel method which combines benefits from exploration and expert data and is straightforward to implement over any off-policy RL algorithm. We make one simplifying hypothesis: expert demonstrations can be seen as perfect data whose underlying policy gets a constant high reward. Based on this assumption, GRI introduces the notion of offline demonstration agents. This agent sends expert data which are processed both concurrently and indistinguishably with the experiences coming from the online RL exploration agent. We show that our approach enables major improvements on vision-based autonomous driving in urban environments. We further validate the GRI method on Mujoco continuous control tasks with different off-policy RL algorithms. Our method ranked first on the CARLA Leaderboard and outperforms World on Rails, the previous state-of-the-art, by 17%.

Results

Task	Dataset	Metric	Value	Model
Autonomous Vehicles	CARLA Leaderboard	Driving Score	36.79	GRIAD
Autonomous Vehicles	CARLA Leaderboard	Infraction penalty	0.6	GRIAD
Autonomous Vehicles	CARLA Leaderboard	Route Completion	61.85	GRIAD
Autonomous Vehicles	CARLA	Driving score	33.785	GRI-based DRL
Autonomous Vehicles	CARLA	Infraction penalty	0.568	GRI-based DRL
Autonomous Vehicles	CARLA	Route completion	57.442	GRI-based DRL
Autonomous Driving	CARLA Leaderboard	Driving Score	36.79	GRIAD
Autonomous Driving	CARLA Leaderboard	Infraction penalty	0.6	GRIAD
Autonomous Driving	CARLA Leaderboard	Route Completion	61.85	GRIAD
Autonomous Driving	CARLA	Driving score	33.785	GRI-based DRL
Autonomous Driving	CARLA	Infraction penalty	0.568	GRI-based DRL
Autonomous Driving	CARLA	Route completion	57.442	GRI-based DRL

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Abstract

Results

Related Papers

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Abstract

Results

Related Papers