Reinforcement Learning for Adaptive Optimal Stationary Control of Linear Stochastic Systems

07/16/2021
by   Bo Pang, et al.
9

This paper studies the adaptive optimal stationary control of continuous-time linear stochastic systems with both additive and multiplicative noises, using reinforcement learning techniques. Based on policy iteration, a novel off-policy reinforcement learning algorithm, named optimistic least-squares-based policy iteration, is proposed which is able to iteratively find near-optimal policies of the adaptive optimal stationary control problem directly from input/state data without explicitly identifying any system matrices, starting from an initial admissible control policy. The solutions given by the proposed optimistic least-squares-based policy iteration are proved to converge to a small neighborhood of the optimal solution with probability one, under mild conditions. The application of the proposed algorithm to a triple inverted pendulum example validates its feasibility and effectiveness.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2020

Robust Reinforcement Learning: A Case Study in Linear Quadratic Regulation

This paper studies the robustness aspect of reinforcement learning algor...
research
08/20/2020

Model-free optimal control of discrete-time systems with additive and multiplicative noises

This paper investigates the optimal control problem for a class of discr...
research
05/19/2020

Robust Policy Iteration for Continuous-time Linear Quadratic Regulation

This paper studies the robustness of policy iteration in the context of ...
research
10/13/2020

Average Cost Optimal Control of Stochastic Systems Using Reinforcement Learning

This paper addresses the average cost minimization problem for discrete-...
research
06/16/2020

Reinforcement Learning Control of Robotic Knee with Human in the Loop by Flexible Policy Iteration

This study is motivated by a new class of challenging control problems d...
research
04/17/2020

Deep Reinforcement Learning for Adaptive Learning Systems

In this paper, we formulate the adaptive learning problem—the problem of...
research
02/20/2020

Adaptive Temporal Difference Learning with Linear Function Approximation

This paper revisits the celebrated temporal difference (TD) learning alg...

Please sign up or login with your details

Forgot password? Click here to reset