DriveGPT4: Interpretable End-to-End Autonomous Driving Via Large Language Model
University of Hong Kong · Zhejiang University · +2 more institutions
Abstract
Multimodallarge language models (MLLMs) have emerged as a prominent area of interest within the research community, given their proficiency in handling and reasoning with non-textual data, including images and videos. This study seeks to extend the application of MLLMs to the realm of autonomous driving by introducing DriveGPT4, a novel interpretable end-to-end autonomous driving system based on LLMs. Capable of processing multi-frame video inputs and textual queries, DriveGPT4 facilitates the interpretation of vehicle actions, offers pertinent reasoning, and effectively addresses a diverse range of questions posed by users. Furthermore, DriveGPT4 predicts low-level vehicle control signals in an end-to-end…
Citation impact
- FWCI
- 70.29
- Percentile
- 100%
- References
- 68
Authors
8Topics & keywords
- End-to-end principle
- Computer science
- Language model
- End of history
- Artificial intelligence
- Political science