Learning Constraints from Demonstrations

12/17/2018
by   Glen Chou, et al.
6

We extend the learning from demonstration paradigm by providing a method for learning unknown constraints shared across tasks, using demonstrations of the tasks, their cost functions, and knowledge of the system dynamics and control constraints. Given safe demonstrations, our method uses hit-and-run sampling to obtain lower cost, and thus unsafe, trajectories. Both safe and unsafe trajectories are used to obtain a consistent representation of the unsafe set via solving an integer program. Our method generalizes across system dynamics and learns a guaranteed subset of the constraint. We also provide theoretical analysis on what subset of the constraint can be learnable from safe demonstrations. We demonstrate our method on linear and nonlinear system dynamics, show that it can be modified to work with suboptimal demonstrations, and that it can also be used to learn constraints in a feature space.

READ FULL TEXT

page 14

page 15

page 19

page 21

research
10/08/2019

Learning Parametric Constraints in High Dimensions from Demonstrations

We present a scalable algorithm for learning parametric constraints in h...
research
05/25/2023

Learning Safety Constraints from Demonstrations with Unknown Rewards

We propose Convex Constraint Learning for Reinforcement Learning (CoCoRL...
research
04/06/2023

Constraint Inference in Control Tasks from Expert Demonstrations via Inverse Optimization

Inferring unknown constraints is a challenging and crucial problem in ma...
research
09/10/2021

Discretizing Dynamics for Maximum Likelihood Constraint Inference

Maximum likelihood constraint inference is a powerful technique for iden...
research
08/10/2021

Recognizing Orientation Slip in Human Demonstrations

Manipulations of a constrained object often use a non-rigid grasp that a...
research
12/08/2021

Gaussian Process Constraint Learning for Scalable Chance-Constrained Motion Planning from Demonstrations

We propose a method for learning constraints represented as Gaussian pro...
research
06/03/2020

Explaining Multi-stage Tasks by Learning Temporal Logic Formulas from Suboptimal Demonstrations

We present a method for learning multi-stage tasks from demonstrations b...

Please sign up or login with your details

Forgot password? Click here to reset