Learning Convex Optimization Control Policies

12/19/2019
by   Akshay Agrawal, et al.
0

Many control policies used in various applications determine the input or action by solving a convex optimization problem that depends on the current state and some parameters. Common examples of such convex optimization control policies (COCPs) include the linear quadratic regulator (LQR), convex model predictive control (MPC), and convex control-Lyapunov or approximate dynamic programming (ADP) policies. These types of control policies are tuned by varying the parameters in the optimization problem, such as the LQR weights, to obtain good performance, judged by application-specific metrics. Tuning is often done by hand, or by simple methods such as a crude grid search. In this paper we propose a method to automate this process, by adjusting the parameters using an approximate gradient of the performance metric with respect to the parameters. Our method relies on recently developed methods that can efficiently evaluate the derivative of the solution of a convex optimization problem with respect to its parameters. We illustrate our method on several examples.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/07/2020

Learning Convex Optimization Models

A convex optimization model predicts an output from an input by solving ...
research
09/14/2020

Disease control as an optimization problem

Traditionally, expert epidemiologists devise policies for disease contro...
research
06/21/2018

A convex method for classification of groups of examples

There are many applications where it important to perform well on a set ...
research
09/26/2017

Optimizing PID parameters with machine learning

This paper examines the Evolutionary programming (EP) method for optimiz...
research
06/19/2015

Approximate Inference with the Variational Holder Bound

We introduce the Variational Holder (VH) bound as an alternative to Vari...
research
07/26/2022

Analysis and Design of Quadratic Neural Networks for Regression, Classification, and Lyapunov Control of Dynamical Systems

This paper addresses the analysis and design of quadratic neural network...
research
10/04/2018

Seamless Parametrization with Arbitrarily Prescribed Cones

Seamless global parametrization of surfaces is a key operation in geomet...

Please sign up or login with your details

Forgot password? Click here to reset