TasksSotADatasetsPapersMethodsSubmitAbout
Papers With Code 2

A community resource for machine learning research: papers, code, benchmarks, and state-of-the-art results.

Explore

Notable BenchmarksAll SotADatasetsPapersMethods

Community

Submit ResultsAbout

Data sourced from the PWC Archive (CC-BY-SA 4.0). Built by the community, for the community.

Papers/TransFuser: Imitation with Transformer-Based Sensor Fusion...

TransFuser: Imitation with Transformer-Based Sensor Fusion for Autonomous Driving

Kashyap Chitta, Aditya Prakash, Bernhard Jaeger, Zehao Yu, Katrin Renz, Andreas Geiger

2022-05-31Sensor FusionImitation LearningCARLA longest6Motion ForecastingAutonomous DrivingObject Detection
PaperPDFCode(official)CodeCode

Abstract

How should we integrate representations from complementary sensors for autonomous driving? Geometry-based fusion has shown promise for perception (e.g. object detection, motion forecasting). However, in the context of end-to-end driving, we find that imitation learning based on existing sensor fusion methods underperforms in complex driving scenarios with a high density of dynamic agents. Therefore, we propose TransFuser, a mechanism to integrate image and LiDAR representations using self-attention. Our approach uses transformer modules at multiple resolutions to fuse perspective view and bird's eye view feature maps. We experimentally validate its efficacy on a challenging new benchmark with long routes and dense traffic, as well as the official leaderboard of the CARLA urban driving simulator. At the time of submission, TransFuser outperforms all prior work on the CARLA leaderboard in terms of driving score by a large margin. Compared to geometry-based fusion, TransFuser reduces the average collisions per kilometer by 48%.

Results

TaskDatasetMetricValueModel
Autonomous VehiclesCARLA LeaderboardDriving Score61.181TransFuser
Autonomous VehiclesCARLA LeaderboardInfraction penalty0.714TransFuser
Autonomous VehiclesCARLA LeaderboardRoute Completion86.694TransFuser
Autonomous VehiclesCARLA LeaderboardDriving Score55.04TransFuser (Reproduced)
Autonomous VehiclesCARLA LeaderboardInfraction penalty0.63TransFuser (Reproduced)
Autonomous VehiclesCARLA LeaderboardRoute Completion89.65TransFuser (Reproduced)
Autonomous VehiclesCARLA LeaderboardDriving Score45.2Latent TransFuser
Autonomous VehiclesCARLA LeaderboardInfraction penalty0.72Latent TransFuser
Autonomous VehiclesCARLA LeaderboardRoute Completion66.31Latent TransFuser
Autonomous VehiclesOpenScenePDMS84TransFuser
Autonomous VehiclesOpenScenePDMS83.8Latent TransFuser
Autonomous DrivingCARLA LeaderboardDriving Score61.181TransFuser
Autonomous DrivingCARLA LeaderboardInfraction penalty0.714TransFuser
Autonomous DrivingCARLA LeaderboardRoute Completion86.694TransFuser
Autonomous DrivingCARLA LeaderboardDriving Score55.04TransFuser (Reproduced)
Autonomous DrivingCARLA LeaderboardInfraction penalty0.63TransFuser (Reproduced)
Autonomous DrivingCARLA LeaderboardRoute Completion89.65TransFuser (Reproduced)
Autonomous DrivingCARLA LeaderboardDriving Score45.2Latent TransFuser
Autonomous DrivingCARLA LeaderboardInfraction penalty0.72Latent TransFuser
Autonomous DrivingCARLA LeaderboardRoute Completion66.31Latent TransFuser
Autonomous DrivingOpenScenePDMS84TransFuser
Autonomous DrivingOpenScenePDMS83.8Latent TransFuser
CARLA longest6CARLADriving Score47TransFuser (TF)
CARLA longest6CARLAInfraction Score0.5TransFuser (TF)
CARLA longest6CARLARoute Completion93TransFuser (TF)
CARLA longest6CARLADriving Score37Latent TransFuser (LTF)
CARLA longest6CARLAInfraction Score0.38Latent TransFuser (LTF)
CARLA longest6CARLARoute Completion95Latent TransFuser (LTF)
CARLA longest6CARLADriving Score27Geometric Fusion (GF)
CARLA longest6CARLAInfraction Score0.3Geometric Fusion (GF)
CARLA longest6CARLARoute Completion91Geometric Fusion (GF)
CARLA longest6CARLADriving Score22Late Fusion (LF)
CARLA longest6CARLAInfraction Score0.27Late Fusion (LF)
CARLA longest6CARLARoute Completion83Late Fusion (LF)

Related Papers

GEMINUS: Dual-aware Global and Scene-Adaptive Mixture-of-Experts for End-to-End Autonomous Driving2025-07-19AGENTS-LLM: Augmentative GENeration of Challenging Traffic Scenarios with an Agentic LLM Framework2025-07-18The Imitation Game: Turing Machine Imitator is Length Generalizable Reasoner2025-07-17Supervised Fine Tuning on Curated Data is Reinforcement Learning (and can be improved)2025-07-17World Model-Based End-to-End Scene Generation for Accident Anticipation in Autonomous Driving2025-07-17Orbis: Overcoming Challenges of Long-Horizon Prediction in Driving World Models2025-07-17Channel-wise Motion Features for Efficient Motion Segmentation2025-07-17LaViPlan : Language-Guided Visual Path Planning with RLVR2025-07-17