Reinforced optimal control

11/24/2020
by   Christian Bayer, et al.
0

Least squares Monte Carlo methods are a popular numerical approximation method for solving stochastic control problems. Based on dynamic programming, their key feature is the approximation of the conditional expectation of future rewards by linear least squares regression. Hence, the choice of basis functions is crucial for the accuracy of the method. Earlier work by some of us [Belomestny, Schoenmakers, Spokoiny, Zharkynbay. Commun. Math. Sci., 18(1):109-121, 2020] proposes to reinforce the basis functions in the case of optimal stopping problems by already computed value functions for later times, thereby considerably improving the accuracy with limited additional computational cost. We extend the reinforced regression method to a general class of stochastic control problems, while considerably improving the method's efficiency, as demonstrated by substantial numerical examples as well as theoretical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/20/2021

Meshfree Approximation for Stochastic Optimal Control Problems

In this work, we study the gradient projection method for solving a clas...
research
08/07/2018

Optimal stopping via deeply boosted backward regression

In this note we propose a new approach towards solving numerically optim...
research
06/22/2020

Forward-Backward RRT: Branched Sampled FBSDEs for Stochastic Optimal Control

We propose a numerical method to solve forward-backward stochastic diffe...
research
12/14/2018

Bernstein approximation of optimal control problems

Bernstein polynomial approximation to a continuous function has a slower...
research
06/26/2019

Control variate selection for Monte Carlo integration

Monte Carlo integration with variance reduction by means of control vari...
research
02/23/2023

Sequential Hierarchical Least-Squares Programming for Prioritized Non-Linear Optimal Control

We present a sequential hierarchical least-squares programming solver wi...
research
12/17/2014

Optimal Triggering of Networked Control Systems

The problem of resource allocation of nonlinear networked control system...

Please sign up or login with your details

Forgot password? Click here to reset