From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions

12/07/2019
by   Warren B. Powell, et al.
0

There are over 15 distinct communities that work in the general area of sequential decisions and information, often referred to as decisions under uncertainty or stochastic optimization. We focus on two of the most important fields: stochastic optimal control, with its roots in deterministic optimal control, and reinforcement learning, with its roots in Markov decision processes. Building on prior work, we describe a unified framework that covers all 15 different communities, and note the strong parallels with the modeling framework of stochastic optimal control. By contrast, we make the case that the modeling framework of reinforcement learning, inherited from discrete Markov decision processes, is quite limited. Our framework (and that of stochastic control) is based on the core problem of optimizing over policies. We describe four classes of policies that we claim are universal, and show that each of these two fields have, in their own way, evolved to include examples of each of these four classes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2021

Model-free Reinforcement Learning for Branching Markov Decision Processes

We study reinforcement learning for the optimal control of Branching Mar...
research
12/13/2017

Convex programming in optimal control and information theory

The main theme of this thesis is the development of computational method...
research
12/29/2017

Characterizing optimal hierarchical policy inference on graphs via non-equilibrium thermodynamics

Hierarchies are of fundamental interest in both stochastic optimal contr...
research
11/29/2022

Performance Evaluation, Optimization and Dynamic Decision in Blockchain Systems: A Recent Overview

With rapid development of blockchain technology as well as integration o...
research
02/14/2020

On State Variables, Bandit Problems and POMDPs

State variables are easily the most subtle dimension of sequential decis...
research
05/02/2018

Reinforcement Learning and Control as Probabilistic Inference: Tutorial and Review

The framework of reinforcement learning or optimal control provides a ma...
research
12/07/2017

Remarks on Bayesian Control Charts

There is a considerable amount of ongoing research on the use of Bayesia...

Please sign up or login with your details

Forgot password? Click here to reset