Optimal Scene Graph Planning with Large Language Model Guidance

09/17/2023
by   Zhirui Dai, et al.
0

Recent advances in metric, semantic, and topological mapping have equipped autonomous robots with semantic concept grounding capabilities to interpret natural language tasks. This work aims to leverage these new capabilities with an efficient task planning algorithm for hierarchical metric-semantic models. We consider a scene graph representation of the environment and utilize a large language model (LLM) to convert a natural language task into a linear temporal logic (LTL) automaton. Our main contribution is to enable optimal hierarchical LTL planning with LLM guidance over scene graphs. To achieve efficiency, we construct a hierarchical planning domain that captures the attributes and connectivity of the scene graph and the task automaton, and provide semantic guidance via an LLM heuristic function. To guarantee optimality, we design an LTL heuristic function that is provably consistent and supplements the potentially inadmissible LLM guidance in multi-heuristic planning. We demonstrate efficient planning of complex natural language tasks in scene graphs of virtualized real environments.

READ FULL TEXT

page 2

page 4

page 6

research
07/12/2023

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning

Large language models (LLMs) have demonstrated impressive results in dev...
research
03/17/2023

LP-SLAM: Language-Perceptive RGB-D SLAM system based on Large Language Model

Simultaneous localization and mapping (SLAM) is a critical technology th...
research
09/13/2019

Scene Graph Parsing by Attention Graph

Scene graph representations, which form a graph of visual object nodes t...
research
07/10/2022

Sequential Manipulation Planning on Scene Graph

We devise a 3D scene graph representation, contact graph+ (cg+), for eff...
research
12/20/2022

Parsel: A Unified Natural Language Framework for Algorithmic Reasoning

Despite recent success in large language model (LLM) reasoning, LLMs sti...
research
07/11/2022

TASKOGRAPHY: Evaluating robot task planning over large 3D scene graphs

3D scene graphs (3DSGs) are an emerging description; unifying symbolic, ...
research
10/22/2019

Language-guided Semantic Mapping and Mobile Manipulation in Partially Observable Environments

Recent advances in data-driven models for grounded language understandin...

Please sign up or login with your details

Forgot password? Click here to reset