ASNets: Deep Learning for Generalised Planning

08/04/2019
by   Sam Toyer, et al.
0

In this paper, we discuss the learning of generalised policies for probabilistic and classical planning problems using Action Schema Networks (ASNets). The ASNet is a neural network architecture that exploits the relational structure of (P)PDDL planning problems to learn a common set of weights that can be applied to any problem in a domain. By mimicking the actions chosen by a traditional, non-learning planner on a handful of small problems in a domain, ASNets are able to learn a generalised reactive policy that can quickly solve much larger instances from the domain. This work extends the ASNet architecture to make it more expressive, while still remaining invariant to a range of symmetries that exist in PPDDL problems. We also present a thorough experimental evaluation of ASNets, including a comparison with heuristic search planners on seven probabilistic and deterministic domains, an extended evaluation on over 18,000 Blocksworld instances, and an ablation study. Finally, we show that sparsity-inducing regularisation can produce ASNets that are compact enough for humans to understand, yielding insights into how the structure of ASNets allows them to generalise across a domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/13/2017

Action Schema Networks: Generalised Policies with Deep Learning

In this paper, we introduce the Action Schema Network (ASNet): a neural ...
research
08/24/2017

Learning Generalized Reactive Policies using Deep Neural Networks

We consider the problem of learning for planning, where knowledge acquir...
research
01/16/2014

Scaling up Heuristic Planning with Relational Decision Trees

Current evaluation functions for heuristic planning are expensive to com...
research
05/21/2017

Generalizing the Role of Determinization in Probabilistic Planning

The stochastic shortest path problem (SSP) is a highly expressive model ...
research
05/05/2020

Generalized Planning With Deep Reinforcement Learning

A hallmark of intelligence is the ability to deduce general principles f...
research
02/18/2020

Generalized Neural Policies for Relational MDPs

A Relational Markov Decision Process (RMDP) is a first-order representat...
research
03/28/2022

Learning Sketches for Decomposing Planning Problems into Subproblems of Bounded Width: Extended Version

Recently, sketches have been introduced as a general language for repres...

Please sign up or login with your details

Forgot password? Click here to reset