Learning Stochastic Optimal Policies via Gradient Descent

06/07/2021
by   Stefano Massaroli, et al.
0

We systematically develop a learning-based treatment of stochastic optimal control (SOC), relying on direct optimization of parametric control policies. We propose a derivation of adjoint sensitivity results for stochastic differential equations through direct application of variational calculus. Then, given an objective function for a predetermined task specifying the desiderata for the controller, we optimize their parameters via iterative gradient descent methods. In doing so, we extend the range of applicability of classical SOC techniques, often requiring strict assumptions on the functional form of system and control. We verify the performance of the proposed approach on a continuous-time, finite horizon portfolio optimization with proportional transaction costs.

READ FULL TEXT

page 5

page 6

research
05/21/2018

Stochastic modified equations for the asynchronous stochastic gradient descent

We propose a stochastic modified equations (SME) for modeling the asynch...
research
10/20/2021

Adaptive Gradient Descent for Optimal Control of Parabolic Equations with Random Parameters

In this paper we extend the adaptive gradient descent (AdaGrad) algorith...
research
10/17/2022

Parametric estimation of stochastic differential equations via online gradient descent

We propose an online parametric estimation method of stochastic differen...
research
11/19/2015

Stochastic modified equations and adaptive stochastic gradient algorithms

We develop the method of stochastic modified equations (SME), in which s...
research
01/14/2021

Optimal Energy Shaping via Neural Approximators

We introduce optimal energy shaping as an enhancement of classical passi...
research
01/23/2013

Learning Finite-State Controllers for Partially Observable Environments

Reactive (memoryless) policies are sufficient in completely observable M...
research
03/16/2023

Variational Principles for Mirror Descent and Mirror Langevin Dynamics

Mirror descent, introduced by Nemirovski and Yudin in the 1970s, is a pr...

Please sign up or login with your details

Forgot password? Click here to reset