Imitation Learning from Nonlinear MPC via the Exact Q-Loss and its Gauss-Newton Approximation

04/03/2023
by   Andrea Ghezzi, et al.
0

This work presents a novel loss function for learning nonlinear Model Predictive Control policies via Imitation Learning. Standard approaches to Imitation Learning neglect information about the expert and generally adopt a loss function based on the distance between expert and learned controls. In this work, we present a loss based on the Q-function directly embedding the performance objectives and constraint satisfaction of the associated Optimal Control Problem (OCP). However, training a Neural Network with the Q-loss requires solving the associated OCP for each new sample. To alleviate the computational burden, we derive a second Q-loss based on the Gauss-Newton approximation of the OCP resulting in a faster training time. We validate our losses against Behavioral Cloning, the standard approach to Imitation Learning, on the control of a nonlinear system with constraints. The final results show that the Q-function-based losses significantly reduce the amount of constraint violations while achieving comparable or better closed-loop costs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2022

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a t...
research
05/19/2022

Learning Energy Networks with Generalized Fenchel-Young Losses

Energy-based models, a.k.a. energy networks, perform inference by optimi...
research
09/11/2019

MPC-Net: A First Principles Guided Policy Search

We present an Imitation Learning approach for the control of dynamical s...
research
06/08/2022

Constrained Imitation Learning for a Flapping Wing Unmanned Aerial Vehicle

This paper presents a data-driven optimal control policy for a micro fla...
research
11/13/2015

Neuroprosthetic decoder training as imitation learning

Neuroprosthetic brain-computer interfaces function via an algorithm whic...
research
03/26/2021

Imitation Learning from MPC for Quadrupedal Multi-Gait Control

We present a learning algorithm for training a single policy that imitat...
research
04/04/2023

Quantum Imitation Learning

Despite remarkable successes in solving various complex decision-making ...

Please sign up or login with your details

Forgot password? Click here to reset