Characterizing optimal hierarchical policy inference on graphs via non-equilibrium thermodynamics

12/29/2017
by   Daniel McNamee, et al.
0

Hierarchies are of fundamental interest in both stochastic optimal control and biological control due to their facilitation of a range of desirable computational traits in a control algorithm and the possibility that they may form a core principle of sensorimotor and cognitive control systems. However, a theoretically justified construction of state-space hierarchies over all spatial resolutions and their evolution through a policy inference process remains elusive. Here, a formalism for deriving such normative representations of discrete Markov decision processes is introduced in the context of graphs. The resulting hierarchies correspond to a hierarchical policy inference algorithm approximating a discrete gradient flow between state-space trajectory densities generated by the prior and optimal policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2023

Bayesian Learning of Optimal Policies in Markov Decision Processes with Countably Infinite State-Space

Models of many real-life applications, such as queuing models of communi...
research
12/07/2019

From Reinforcement Learning to Optimal Control: A unified framework for sequential decisions

There are over 15 distinct communities that work in the general area of ...
research
01/30/2023

Attack Impact Evaluation for Stochastic Control Systems through Alarm Flag State Augmentation

This note addresses the problem of evaluating the impact of an attack on...
research
11/28/2019

Hierarchical model-based policy optimization: from actions to action sequences and back

We develop a normative framework for hierarchical model-based policy opt...
research
06/12/2021

Model-free Reinforcement Learning for Branching Markov Decision Processes

We study reinforcement learning for the optimal control of Branching Mar...
research
03/31/2022

Attack Impact Evaluation by Exact Convexification through State Space Augmentation

We address the attack impact evaluation problem for control system secur...
research
09/09/2019

Policy Space Identification in Configurable Environments

We study the problem of identifying the policy space of a learning agent...

Please sign up or login with your details

Forgot password? Click here to reset