Learning Domain-Independent Planning Heuristics with Hypergraph Networks

11/29/2019
by   William Shen, et al.
0

We present the first approach capable of learning domain-independent planning heuristics entirely from scratch. The heuristics we learn map the hypergraph representation of the delete-relaxation of the planning problem at hand, to a cost estimate that approximates that of the least-cost path from the current state to the goal through the hypergraph. We generalise Graph Networks to obtain a new framework for learning over hypergraphs, which we specialise to learn planning heuristics by training over state/value pairs obtained from optimal cost plans. Our experiments show that the resulting architecture, STRIPS-HGNs, is capable of learning heuristics that are competitive with existing delete-relaxation heuristics including LM-cut. We show that the heuristics we learn are able to generalise across different problems and domains, including to domains that were not seen during training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/06/2019

A novel approach to model exploration for value function learning

Planning and Learning are complementary approaches. Planning relies on d...
research
07/21/2017

Towards learning domain-independent planning heuristics

Automated planning remains one of the most general paradigms in Artifici...
research
10/18/2021

Goal Agnostic Planning using Maximum Likelihood Paths in Hypergraph World Models

In this paper, we present a hypergraph–based machine learning algorithm,...
research
11/22/2017

A Generic Framework for Engineering Graph Canonization Algorithms

The state-of-the-art tools for practical graph canonization are all base...
research
07/10/2020

Learning Generalized Relational Heuristic Networks for Model-Agnostic Planning

Computing goal-directed behavior (sequential decision-making, or plannin...
research
07/25/2023

On Solving the Rubik's Cube with Domain-Independent Planners Using Standard Representations

Rubik's Cube (RC) is a well-known and computationally challenging puzzle...
research
07/04/2012

Cost Sensitive Reachability Heuristics for Handling State Uncertainty

While POMDPs provide a general platform for non-deterministic conditiona...

Please sign up or login with your details

Forgot password? Click here to reset