Safe Learning of Lifted Action Models

07/09/2021
by   Brendan Juba, et al.
0

Creating a domain model, even for classical, domain-independent planning, is a notoriously hard knowledge-engineering task. A natural approach to solve this problem is to learn a domain model from observations. However, model learning approaches frequently do not provide safety guarantees: the learned model may assume actions are applicable when they are not, and may incorrectly capture actions' effects. This may result in generating plans that will fail when executed. In some domains such failures are not acceptable, due to the cost of failure or inability to replan online after failure. In such settings, all learning must be done offline, based on some observations collected, e.g., by some other agents or a human. Through this learning, the task is to generate a plan that is guaranteed to be successful. This is called the model-free planning problem. Prior work proposed an algorithm for solving the model-free planning problem in classical planning. However, they were limited to learning grounded domains, and thus they could not scale. We generalize this prior work and propose the first safe model-free planning algorithm for lifted domains. We prove the correctness of our approach, and provide a statistical analysis showing that the number of trajectories needed to solve future problems with high probability is linear in the potential size of the domain model. We also present experiments on twelve IPC domains showing that our approach is able to learn the real action model in all cases with at most two trajectories.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2017

Efficient, Safe, and Probably Approximately Complete Learning of Action Models

In this paper we explore the theoretical boundaries of planning in a set...
research
07/19/2019

Representation Learning for Classical Planning from Partially Observed Traces

Specifying a complete domain model is time-consuming, which has been a b...
research
12/28/2018

Dynamic Planning Networks

We introduce Dynamic Planning Networks (DPN), a novel architecture for d...
research
09/28/2021

Competence-Aware Path Planning via Introspective Perception

Robots deployed in the real world over extended periods of time need to ...
research
10/07/2019

A Learnable Safety Measure

Failures are challenging for learning to control physical systems since ...
research
10/25/2021

Online Strategy Synthesis for Safe and Optimized Control of Steerable Needles

Autonomous systems are often applied in uncertain environments, which re...
research
11/26/2020

AMLSI: A Novel Accurate Action Model Learning Algorithm

This paper presents new approach based on grammar induction called AMLSI...

Please sign up or login with your details

Forgot password? Click here to reset