Kashyap Chitta, Aditya Prakash, Bernhard Jaeger, Zehao Yu, Katrin Renz, Andreas Geiger
How should we integrate representations from complementary sensors for autonomous driving? Geometry-based fusion has shown promise for perception (e.g. object detection, motion forecasting). However, in the context of end-to-end driving, we find that imitation learning based on existing sensor fusion methods underperforms in complex driving scenarios with a high density of dynamic agents. Therefore, we propose TransFuser, a mechanism to integrate image and LiDAR representations using self-attention. Our approach uses transformer modules at multiple resolutions to fuse perspective view and bird's eye view feature maps. We experimentally validate its efficacy on a challenging new benchmark with long routes and dense traffic, as well as the official leaderboard of the CARLA urban driving simulator. At the time of submission, TransFuser outperforms all prior work on the CARLA leaderboard in terms of driving score by a large margin. Compared to geometry-based fusion, TransFuser reduces the average collisions per kilometer by 48%.
| Task | Dataset | Metric | Value | Model |
|---|---|---|---|---|
| Autonomous Vehicles | CARLA Leaderboard | Driving Score | 61.181 | TransFuser |
| Autonomous Vehicles | CARLA Leaderboard | Infraction penalty | 0.714 | TransFuser |
| Autonomous Vehicles | CARLA Leaderboard | Route Completion | 86.694 | TransFuser |
| Autonomous Vehicles | CARLA Leaderboard | Driving Score | 55.04 | TransFuser (Reproduced) |
| Autonomous Vehicles | CARLA Leaderboard | Infraction penalty | 0.63 | TransFuser (Reproduced) |
| Autonomous Vehicles | CARLA Leaderboard | Route Completion | 89.65 | TransFuser (Reproduced) |
| Autonomous Vehicles | CARLA Leaderboard | Driving Score | 45.2 | Latent TransFuser |
| Autonomous Vehicles | CARLA Leaderboard | Infraction penalty | 0.72 | Latent TransFuser |
| Autonomous Vehicles | CARLA Leaderboard | Route Completion | 66.31 | Latent TransFuser |
| Autonomous Vehicles | OpenScene | PDMS | 84 | TransFuser |
| Autonomous Vehicles | OpenScene | PDMS | 83.8 | Latent TransFuser |
| Autonomous Driving | CARLA Leaderboard | Driving Score | 61.181 | TransFuser |
| Autonomous Driving | CARLA Leaderboard | Infraction penalty | 0.714 | TransFuser |
| Autonomous Driving | CARLA Leaderboard | Route Completion | 86.694 | TransFuser |
| Autonomous Driving | CARLA Leaderboard | Driving Score | 55.04 | TransFuser (Reproduced) |
| Autonomous Driving | CARLA Leaderboard | Infraction penalty | 0.63 | TransFuser (Reproduced) |
| Autonomous Driving | CARLA Leaderboard | Route Completion | 89.65 | TransFuser (Reproduced) |
| Autonomous Driving | CARLA Leaderboard | Driving Score | 45.2 | Latent TransFuser |
| Autonomous Driving | CARLA Leaderboard | Infraction penalty | 0.72 | Latent TransFuser |
| Autonomous Driving | CARLA Leaderboard | Route Completion | 66.31 | Latent TransFuser |
| Autonomous Driving | OpenScene | PDMS | 84 | TransFuser |
| Autonomous Driving | OpenScene | PDMS | 83.8 | Latent TransFuser |
| CARLA longest6 | CARLA | Driving Score | 47 | TransFuser (TF) |
| CARLA longest6 | CARLA | Infraction Score | 0.5 | TransFuser (TF) |
| CARLA longest6 | CARLA | Route Completion | 93 | TransFuser (TF) |
| CARLA longest6 | CARLA | Driving Score | 37 | Latent TransFuser (LTF) |
| CARLA longest6 | CARLA | Infraction Score | 0.38 | Latent TransFuser (LTF) |
| CARLA longest6 | CARLA | Route Completion | 95 | Latent TransFuser (LTF) |
| CARLA longest6 | CARLA | Driving Score | 27 | Geometric Fusion (GF) |
| CARLA longest6 | CARLA | Infraction Score | 0.3 | Geometric Fusion (GF) |
| CARLA longest6 | CARLA | Route Completion | 91 | Geometric Fusion (GF) |
| CARLA longest6 | CARLA | Driving Score | 22 | Late Fusion (LF) |
| CARLA longest6 | CARLA | Infraction Score | 0.27 | Late Fusion (LF) |
| CARLA longest6 | CARLA | Route Completion | 83 | Late Fusion (LF) |