SayCanPay: Heuristic Planning with Large Language Models using Learnable Domain Knowledge

08/24/2023
by   Rishi Hazra, et al.
0

Large Language Models (LLMs) have demonstrated impressive planning abilities due to their vast "world knowledge". Yet, obtaining plans that are both feasible (grounded in affordances) and cost-effective (in plan length), remains a challenge, despite recent progress. This contrasts with heuristic planning methods that employ domain knowledge (formalized in action models such as PDDL) and heuristic search to generate feasible, optimal plans. Inspired by this, we propose to combine the power of LLMs and heuristic planning by leveraging the world knowledge of LLMs and the principles of heuristic search. Our approach, SayCanPay, employs LLMs to generate actions (Say) guided by learnable domain knowledge, that evaluates actions' feasibility (Can) and long-term reward/payoff (Pay), and heuristic search to select the best sequence of actions. Our contributions are (1) a novel framing of the LLM planning problem in the context of heuristic planning, (2) integrating grounding and cost-effective elements into the generated plans, and (3) using heuristic search over actions. Our extensive evaluations show that our model surpasses other LLM planning approaches.

READ FULL TEXT

page 2

page 12

page 13

research
05/25/2023

On the Planning Abilities of Large Language Models – A Critical Investigation

Intrigued by the claims of emergent reasoning capabilities in LLMs train...
research
07/12/2023

SayPlan: Grounding Large Language Models using 3D Scene Graphs for Scalable Task Planning

Large language models (LLMs) have demonstrated impressive results in dev...
research
01/23/2014

Online Speedup Learning for Optimal Planning

Domain-independent planning is one of the foundational areas in the fiel...
research
04/22/2023

LLM+P: Empowering Large Language Models with Optimal Planning Proficiency

Large language models (LLMs) have demonstrated remarkable zero-shot gene...
research
05/26/2023

Learning and Leveraging Verifiers to Improve Planning Capabilities of Pre-trained Language Models

There have been wide spread claims in the literature about the emergent ...
research
01/27/2020

Long term planning of military aircraft flight and maintenance operations

We present the Flight and Maintenance Planning (FMP) problem in its mili...
research
12/19/2022

Don't Generate, Discriminate: A Proposal for Grounding Language Models to Real-World Environments

A key missing ability of current language models (LMs) is grounding to r...

Please sign up or login with your details

Forgot password? Click here to reset