TransFuser: Imitation With Transformer-Based Sensor Fusion for Autonomous Driving

TH Bingen University of Applied Sciences · Max Planck Institute for Intelligent Systems · +1 more institution

PubMed
Indexed incrossrefpubmed

Abstract

How should we integrate representations from complementary sensors for autonomous driving? Geometry-based fusion has shown promise for perception (e.g., object detection, motion forecasting). However, in the context of end-to-end driving, we find that imitation learning based on existing sensor fusion methods underperforms in complex driving scenarios with a high density of dynamic agents. Therefore, we propose TransFuser, a mechanism to integrate image and LiDAR representations using self-attention. Our approach uses transformer modules at multiple resolutions to fuse perspective view and bird's eye view feature maps. We experimentally validate its efficacy on a challenging new benchmark with long routes and…

No related works found for this paper.

Funding