Demonstration Informed Specification Search

This paper considers the problem of learning history dependent task specifications, e.g. automata and temporal logic, from expert demonstrations. Unfortunately, the (countably infinite) number of tasks under consideration combined with an a-priori ignorance of what historical features are needed to encode the demonstrated task makes existing approaches to learning tasks from demonstrations inapplicable. To address this deficit, we propose Demonstration Informed Specification Search (DISS): a family of algorithms parameterized by black box access to (i) a maximum entropy planner and (ii) an algorithm for identifying concepts, e.g., automata, from labeled examples. DISS works by alternating between (i) conjecturing labeled examples to make the demonstrations less surprising and (ii) sampling concepts consistent with the current labeled examples. In the context of tasks described by deterministic finite automata, we provide a concrete implementation of DISS that efficiently combines partial knowledge of the task and a single expert demonstration to identify the full task specification.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/03/2020

Elaborating on Learned Demonstrations with Temporal Logic Specifications

Most current methods for learning from demonstrations assume that those ...
research
01/04/2021

Robust Maximum Entropy Behavior Cloning

Imitation learning (IL) algorithms use expert demonstrations to learn a ...
research
09/07/2022

Optimizing Demonstrated Robot Manipulation Skills for Temporal Logic Constraints

For performing robotic manipulation tasks, the core problem is determini...
research
07/26/2019

Learning Task Specifications from Demonstrations via the Principle of Maximum Causal Entropy

In many settings (e.g., robotics) demonstrations provide a natural way t...
research
03/09/2022

Learning to control from expert demonstrations

In this paper, we revisit the problem of learning a stabilizing controll...
research
05/04/2016

A Bayesian Approach to Policy Recognition and State Representation Learning

Learning from demonstration (LfD) is the process of building behavioral ...
research
04/14/2022

Synthesizing Analytical SQL Queries from Computation Demonstration

Analytical SQL is widely used in modern database applications and data a...

Please sign up or login with your details

Forgot password? Click here to reset