SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning

07/12/2023
by   Krishan Rana, et al.
0

Large language models (LLMs) have demonstrated impressive results in developing generalist planning agents for diverse tasks. However, grounding these plans in expansive, multi-floor, and multi-room environments presents a significant challenge for robotics. We introduce SayPlan, a scalable approach to LLM-based, large-scale task planning for robotics using 3D scene graph (3DSG) representations. To ensure the scalability of our approach, we: (1) exploit the hierarchical nature of 3DSGs to allow LLMs to conduct a semantic search for task-relevant subgraphs from a smaller, collapsed representation of the full graph; (2) reduce the planning horizon for the LLM by integrating a classical path planner and (3) introduce an iterative replanning pipeline that refines the initial plan using feedback from a scene graph simulator, correcting infeasible actions and avoiding planning failures. We evaluate our approach on two large-scale environments spanning up to 3 floors, 36 rooms and 140 objects, and show that our approach is capable of grounding large-scale, long-horizon task plans from abstract, and natural language instruction for a mobile manipulator robot to execute.

READ FULL TEXT

page 19

page 20

page 31

page 32

page 37

page 40

page 41

page 42

research
05/12/2023

Learning to Reason over Scene Graphs: A Case Study of Finetuning GPT-2 into a Robot Language Model for Grounded Task Planning

Long-horizon task planning is essential for the development of intellige...
research
09/14/2023

GRID: Scene-Graph-based Instruction-driven Robotic Task Planning

Recent works have shown that Large Language Models (LLMs) can promote gr...
research
09/17/2023

Optimal Scene Graph Planning with Large Language Model Guidance

Recent advances in metric, semantic, and topological mapping have equipp...
research
08/24/2023

SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge

Large Language Models (LLMs) have demonstrated impressive planning abili...
research
08/26/2023

ISR-LLM: Iterative Self-Refined Large Language Model for Long-Horizon Sequential Task Planning

Motivated by the substantial achievements observed in Large Language Mod...
research
12/08/2022

LLM-Planner: Few-Shot Grounded Planning for Embodied Agents with Large Language Models

This study focuses on embodied agents that can follow natural language i...
research
06/02/2023

Egocentric Planning for Scalable Embodied Task Achievement

Embodied agents face significant challenges when tasked with performing ...

Please sign up or login with your details

Forgot password? Click here to reset