Primal-dual Learning for the Model-free Risk-constrained Linear Quadratic Regulator

11/22/2020
by   Feiran Zhao, et al.
0

Risk-aware control, though with promise to tackle unexpected events, requires a known exact dynamical model. In this work, we propose a model-free framework to learn a risk-aware controller with a focus on the linear system. We formulate it as a discrete-time infinite-horizon LQR problem with a state predictive variance constraint. To solve it, we parameterize the policy with a feedback gain pair and leverage primal-dual methods to optimize it by solely using data. We first study the optimization landscape of the Lagrangian function and establish the strong duality in spite of its non-convex nature. Alongside, we find that the Lagrangian function enjoys an important local gradient dominance property, which is then exploited to develop a convergent random search algorithm to learn the dual function. Furthermore, we propose a primal-dual algorithm with global convergence to learn the optimal policy-multiplier pair. Finally, we validate our results via simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/20/2023

Last-Iterate Convergent Policy Gradient Primal-Dual Methods for Constrained MDPs

We study the problem of computing an optimal policy of an infinite-horiz...
research
06/13/2023

A Primal-Dual-Critic Algorithm for Offline Constrained Reinforcement Learning

Offline constrained reinforcement learning (RL) aims to learn a policy t...
research
02/13/2022

Reinforcement Learning Based Power Control for Reliable Wireless Transmission

In this paper, we investigate a sequential power allocation problem over...
research
11/10/2019

Model-Free Learning of Optimal Ergodic Policies in Wireless Systems

Learning optimal resource allocation policies in wireless systems can be...
research
09/05/2017

A second order primal-dual method for nonsmooth convex composite optimization

We develop a second order primal-dual method for optimization problems i...
research
02/09/2016

Large scale multi-objective optimization: Theoretical and practical challenges

Multi-objective optimization (MOO) is a well-studied problem for several...
research
06/13/2018

On Landscape of Lagrangian Functions and Stochastic Search for Constrained Nonconvex Optimization

We study constrained nonconvex optimization problems in machine learning...

Please sign up or login with your details

Forgot password? Click here to reset