TransFuser: Imitation With Transformer-Based Sensor Fusion for Autonomous Driving
TH Bingen University of Applied Sciences · Max Planck Institute for Intelligent Systems · +1 more institution
Abstract
How should we integrate representations from complementary sensors for autonomous driving? Geometry-based fusion has shown promise for perception (e.g., object detection, motion forecasting). However, in the context of end-to-end driving, we find that imitation learning based on existing sensor fusion methods underperforms in complex driving scenarios with a high density of dynamic agents. Therefore, we propose TransFuser, a mechanism to integrate image and LiDAR representations using self-attention. Our approach uses transformer modules at multiple resolutions to fuse perspective view and bird's eye view feature maps. We experimentally validate its efficacy on a challenging new benchmark with long routes and…
Citation impact
- FWCI
- 25.24
- Percentile
- 100%
- References
- 136
Authors
6- KCKashyap ChittaCorresponding
TH Bingen University of Applied Sciences, Max Planck Institute for Intelligent Systems
- APAditya Prakash
TH Bingen University of Applied Sciences, University of Illinois Urbana-Champaign, Max Planck Institute for Intelligent Systems
- BJBernhard Jaeger
TH Bingen University of Applied Sciences, Max Planck Institute for Intelligent Systems
- ZYZehao Yu
TH Bingen University of Applied Sciences, Max Planck Institute for Intelligent Systems
- KRKatrin Renz
TH Bingen University of Applied Sciences, Max Planck Institute for Intelligent Systems
Topics & keywords
- Computer science
- Artificial intelligence
- Computer vision
- Margin (machine learning)
- Lidar
- Object detection
- Sensor fusion
- Transformer
- Sustainable cities and communities