Guided Cost Learning: Deep Inverse Optimal Control via Policy Optimization

03/01/2016
by   Chelsea Finn, et al.
0

Reinforcement learning can acquire complex behaviors from high-level specifications. However, defining a cost function that can be optimized effectively and encodes the correct task is challenging in practice. We explore how inverse optimal control (IOC) can be used to learn behaviors from demonstrations, with applications to torque control of high-dimensional robotic systems. Our method addresses two key challenges in inverse optimal control: first, the need for informative features and effective regularization to impose structure on the cost, and second, the difficulty of learning the cost function under unknown dynamics for high-dimensional continuous systems. To address the former challenge, we present an algorithm capable of learning arbitrary nonlinear cost functions, such as neural networks, without meticulous feature engineering. To address the latter challenge, we formulate an efficient sample-based approximation for MaxEnt IOC. We evaluate our method on a series of simulated tasks and real-world robotic manipulation problems, demonstrating substantial improvement over prior methods both in terms of task complexity and sample efficiency.

READ FULL TEXT

page 1

page 6

page 7

research
06/18/2012

Continuous Inverse Optimal Control with Locally Optimal Examples

Inverse optimal control, also known as inverse reinforcement learning, i...
research
05/22/2018

Learning to Optimize via Wasserstein Deep Inverse Optimal Control

We study the inverse optimal control problem in social sciences: we aim ...
research
04/10/2019

Learning Trajectory Prediction with Continuous Inverse Optimal Control via Langevin Sampling of Energy-Based Models

Autonomous driving is a challenging multiagent domain which requires opt...
research
10/18/2020

Model-Based Inverse Reinforcement Learning from Visual Demonstrations

Scaling model-based inverse reinforcement learning (IRL) to real robotic...
research
06/09/2022

Receding Horizon Inverse Reinforcement Learning

Inverse reinforcement learning (IRL) seeks to infer a cost function that...
research
03/21/2018

Inverse Optimal Control with Incomplete Observations

In this article, we consider the inverse optimal control problem given i...
research
05/07/2019

Optimal Control of Complex Systems through Variational Inference with a Discrete Event Decision Process

Complex social systems are composed of interconnected individuals whose ...

Please sign up or login with your details

Forgot password? Click here to reset