Learning convex bounds for linear quadratic control policy synthesis

06/01/2018
by   Jack Umenberger, et al.
0

Learning to make decisions from observed data in dynamic environments remains a problem of fundamental importance in a number of fields, from artificial intelligence and robotics, to medicine and finance. This paper concerns the problem of learning control policies for unknown linear dynamical systems so as to maximize a quadratic reward function. We present a method to optimize the expected value of the reward over the posterior distribution of the unknown system parameters, given data. The algorithm involves sequential convex programing, and enjoys reliable local convergence and robust stability guarantees. Numerical simulations and stabilization of a real-world inverted pendulum are used to demonstrate the approach, with strong performance and robustness properties observed in both.

READ FULL TEXT
research
06/04/2019

Robust exploration in linear quadratic reinforcement learning

This paper concerns the problem of learning control policies for an unkn...
research
05/17/2021

Probabilistic robust linear quadratic regulators with Gaussian processes

Probabilistic models such as Gaussian processes (GPs) are powerful tools...
research
09/19/2019

Value function estimation in Markov reward processes: Instance-dependent ℓ_∞-bounds for policy evaluation

Markov reward processes (MRPs) are used to model stochastic phenomena ar...
research
06/20/2012

Apprenticeship Learning using Inverse Reinforcement Learning and Gradient Methods

In this paper we propose a novel gradient algorithm to learn a policy fr...
research
03/31/2022

Synthesis of Stabilizing Recurrent Equilibrium Network Controllers

We propose a parameterization of a nonlinear dynamic controller based on...
research
06/15/2020

Learning Expected Reward for Switched Linear Control Systems: A Non-Asymptotic View

In this work, we show existence of invariant ergodic measure for switche...
research
10/30/2019

Continuous Control with Contexts, Provably

A fundamental challenge in artificial intelligence is to build an agent ...

Please sign up or login with your details

Forgot password? Click here to reset