TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/GRI: General Reinforced Imitation and its Application to V...

GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving

Raphael Chekroun, Marin Toromanoff, Sascha Hornauer, Fabien Moutarde

2021-11-16MuJoCoContinuous ControlDecision MakingAutonomous DrivingCARLA MAP Leaderboard
PaperPDF

Abstract

Deep reinforcement learning (DRL) has been demonstrated to be effective for several complex decision-making applications such as autonomous driving and robotics. However, DRL is notoriously limited by its high sample complexity and its lack of stability. Prior knowledge, e.g. as expert demonstrations, is often available but challenging to leverage to mitigate these issues. In this paper, we propose General Reinforced Imitation (GRI), a novel method which combines benefits from exploration and expert data and is straightforward to implement over any off-policy RL algorithm. We make one simplifying hypothesis: expert demonstrations can be seen as perfect data whose underlying policy gets a constant high reward. Based on this assumption, GRI introduces the notion of offline demonstration agents. This agent sends expert data which are processed both concurrently and indistinguishably with the experiences coming from the online RL exploration agent. We show that our approach enables major improvements on vision-based autonomous driving in urban environments. We further validate the GRI method on Mujoco continuous control tasks with different off-policy RL algorithms. Our method ranked first on the CARLA Leaderboard and outperforms World on Rails, the previous state-of-the-art, by 17%.

Results

TaskDatasetMetricValueModel
Autonomous VehiclesCARLA LeaderboardDriving Score36.79GRIAD
Autonomous VehiclesCARLA LeaderboardInfraction penalty0.6GRIAD
Autonomous VehiclesCARLA LeaderboardRoute Completion61.85GRIAD
Autonomous VehiclesCARLADriving score33.785GRI-based DRL
Autonomous VehiclesCARLAInfraction penalty0.568GRI-based DRL
Autonomous VehiclesCARLARoute completion57.442GRI-based DRL
Autonomous DrivingCARLA LeaderboardDriving Score36.79GRIAD
Autonomous DrivingCARLA LeaderboardInfraction penalty0.6GRIAD
Autonomous DrivingCARLA LeaderboardRoute Completion61.85GRIAD
Autonomous DrivingCARLADriving score33.785GRI-based DRL
Autonomous DrivingCARLAInfraction penalty0.568GRI-based DRL
Autonomous DrivingCARLARoute completion57.442GRI-based DRL

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19Graph-Structured Data Analysis of Component Failure in Autonomous Cargo Ships Based on Feature Fusion2025-07-18AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback2025-07-17Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)2025-07-17Higher-Order Pattern Unification Modulo Similarity Relations2025-07-17Exploiting Constraint Reasoning to Build Graphical Explanations for Mixed-Integer Linear Programming2025-07-17World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17