Search Methods for Policy Decompositions

03/29/2022
by   Ashwin Khadke, et al.
0

Computing optimal control policies for complex dynamical systems requires approximation methods to remain computationally tractable. Several approximation methods have been developed to tackle this problem. However, these methods do not reason about the suboptimality induced in the resulting control policies due to these approximations. We introduced Policy Decomposition, an approximation method that provides a suboptimality estimate, in our earlier work. Policy decomposition proposes strategies to break an optimal control problem into lower-dimensional subproblems, whose optimal solutions are combined to build a control policy for the original system. However, the number of possible strategies to decompose a system scale quickly with the complexity of a system, posing a combinatorial challenge. In this work we investigate the use of Genetic Algorithm and Monte-Carlo Tree Search to alleviate this challenge. We identify decompositions for swing-up control of a 4 degree-of-freedom manipulator, balance control of a simplified biped, and hover control of a quadcopter.

READ FULL TEXT
research
03/03/2021

Policy Decomposition: Approximate Optimal Control with Suboptimality Estimates

Numerically computing global policies to optimal control problems for co...
research
09/15/2022

Sparsity Inducing Representations for Policy Decompositions

Policy Decomposition (PoDec) is a framework that lessens the curse of di...
research
07/23/2023

Optimal Control of Multiclass Fluid Queueing Networks: A Machine Learning Approach

We propose a machine learning approach to the optimal control of multicl...
research
06/29/2021

Limited depth bandit-based strategy for Monte Carlo planning in continuous action spaces

This paper addresses the problem of optimal control using search trees. ...
research
10/23/2017

Stability Analysis of Optimal Adaptive Control using Value Iteration with Approximation Errors

Adaptive optimal control using value iteration initiated from a stabiliz...
research
04/08/2023

Stochastic Nonlinear Control via Finite-dimensional Spectral Dynamic Embedding

Optimal control is notoriously difficult for stochastic nonlinear system...
research
01/13/2020

On the synthesis of control policies from noisy example datasets: a probabilistic approach

In this note we consider the problem of synthesizing optimal control pol...

Please sign up or login with your details

Forgot password? Click here to reset