Gradient Flows for Regularized Stochastic Control Problems

06/10/2020
by   David Siska, et al.
0

This work is motivated by a desire to extend the theoretical underpinning for the convergence of stochastic gradient type algorithms widely used in the reinforcement learning community to solve control problems. This paper studies stochastic control problems regularized by the relative entropy, where the action space is the space of measures. This setting includes relaxed control problems, problems of finding Markovian controls with the control function replaced by an idealized infinitely wide neural network and can be extended to the search for causal optimal transport maps. By exploiting the Pontryagin optimality principle, we construct gradient flow for the measure-valued control process along which the cost functional is guaranteed to decrease. It is shown that under appropriate conditions, this gradient flow has an invariant measure which is the optimal control for the regularized stochastic control problem. If the problem we work with is sufficiently convex, the gradient flow converges exponentially fast.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2023

A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee

In this work, we consider the stochastic optimal control problem in cont...
research
07/12/2021

A stochastic Gauss-Newton algorithm for regularized semi-discrete optimal transport

We introduce a new second order stochastic algorithm to estimate the ent...
research
07/05/2022

Global Convergence of Successive Approximations for Non-convex Stochastic Optimal Control Problems

This paper focuses on finding approximate solutions to the stochastic op...
research
07/08/2020

On Entropic Optimization and Path Integral Control

This article is motivated by the question whether it is possible to solv...
research
10/24/2021

Deep Learning Approximation of Diffeomorphisms via Linear-Control Systems

In this paper we propose a Deep Learning architecture to approximate dif...
research
01/10/2019

Accelerated Flow for Probability distributions

This paper presents a methodology and numerical algorithms for construct...
research
06/26/2020

Semi-discrete optimization through semi-discrete optimal transport: a framework for neural architecture search

In this paper we introduce a theoretical framework for semi-discrete opt...

Please sign up or login with your details

Forgot password? Click here to reset