LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

Song, Chan Hee; Sadler, Brian M.; Wu, Jiaman; Chao, Wei‐Lun; Washington, Clayton; Su, Yu

doi:10.1109/iccv51070.2023.00280

articleOct 1, 2023Closed access

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

CHChan Hee Song BMBrian M. Sadler JWJiaman Wu WCWei‐Lun Chao CWClayton Washington

The Ohio State University · DevCom (Czechia)

Indexed incrossref

Abstract

This study focuses on using large language models (LLMs) as a planner for embodied agents that can follow natural language instructions to complete complex tasks in a visually-perceived environment. The high data cost and poor sample efficiency of existing methods hinders the development of versatile agents that are capable of many tasks and can learn new tasks quickly. In this work, we propose a novel method, LLM-Planner, that harnesses the power of large language models to do few-shot planning for embodied agents. We further propose a simple but effective way to enhance LLMs with physical grounding to generate and update plans that are grounded in the current environment. Experiments on the ALFRED dataset…

Citation impact

345

total citations

FWCI: 39.24
Percentile: 100%
References: 30

Citations per year

Authors

6

Topics & keywords

Topics

Keywords

Embodied cognition
Planner
Computer science
Task (project management)
Shot (pellet)
Sample (material)
Natural language generation
Artificial intelligence

No related works found for this paper.