articleIEEE Robotics and Automation LettersAug 7, 2024GREEN OA

DriveGPT4: Interpretable End-to-End Autonomous Driving Via Large Language Model

University of Hong Kong · Zhejiang University · +2 more institutions

Indexed incrossref

Abstract

Multimodallarge language models (MLLMs) have emerged as a prominent area of interest within the research community, given their proficiency in handling and reasoning with non-textual data, including images and videos. This study seeks to extend the application of MLLMs to the realm of autonomous driving by introducing DriveGPT4, a novel interpretable end-to-end autonomous driving system based on LLMs. Capable of processing multi-frame video inputs and textual queries, DriveGPT4 facilitates the interpretation of vehicle actions, offers pertinent reasoning, and effectively addresses a diverse range of questions posed by users. Furthermore, DriveGPT4 predicts low-level vehicle control signals in an end-to-end…

Citation impact

311
total citations
FWCI
70.29
Percentile
100%
References
68
Citations per year

Authors

8

Topics & keywords

Keywords
  • End-to-end principle
  • Computer science
  • Language model
  • End of history
  • Artificial intelligence
  • Political science
No related works found for this paper.

Funding