Open Loop Execution of Tree-Search Algorithms

05/03/2018
by   Erwan Lecarpentier, et al.
0

In the context of tree-search stochastic planning algorithms where a generative model is available, we consider on-line planning algorithms building trees in order to recommend an action. We investigate the question of avoiding re-planning in subsequent decision steps by directly using sub-trees as action recommender. Firstly, we propose a method for open loop control via a new algorithm taking the decision of re-planning or not at each time step based on an analysis of the statistics of the sub-tree. Secondly, we show that the probability of selecting a suboptimal action at any depth of the tree can be upper bounded and converges towards zero. Moreover, this upper bound decays in a logarithmic way between subsequent depths. This leads to a distinction between node-wise optimality and state-wise optimality. Finally, we empirically demonstrate that our method achieves a compromise between loss of performance and computational gain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/10/2019

Memory Bounded Open-Loop Planning in Large POMDPs using Thompson Sampling

State-of-the-art approaches to partially observable planning like POMCP ...
research
04/09/2019

Practical Open-Loop Optimistic Planning

We consider the problem of online planning in a Markov Decision Process ...
research
06/10/2020

Planning in Markov Decision Processes with Gap-Dependent Sample Complexity

We propose MDP-GapE, a new trajectory-based Monte-Carlo Tree Search algo...
research
07/11/2019

Adaptive Thompson Sampling Stacks for Memory Bounded Open-Loop Planning

We propose Stable Yet Memory Bounded Open-Loop (SYMBOL) planning, a gene...
research
04/02/2022

A UCB-based Tree Search Approach to Joint Verification-Correction Strategy for Large Scale Systems

Verification planning is a sequential decision-making problem that speci...
research
09/29/2018

Stochastic 2-D Motion Planning with a POMDP Framework

Motion planning is challenging when it comes to the case of imperfect st...
research
02/07/2021

An Analysis of Frame-skipping in Reinforcement Learning

In the practice of sequential decision making, agents are often designed...

Please sign up or login with your details

Forgot password? Click here to reset