Deep imagination is a close to optimal policy for planning in large decision trees under limited resources

04/13/2021
by   Rubén Moreno-Bote, et al.
0

Many decisions involve choosing an uncertain course of actions in deep and wide decision trees, as when we plan to visit an exotic country for vacation. In these cases, exhaustive search for the best sequence of actions is not tractable due to the large number of possibilities and limited time or computational resources available to make the decision. Therefore, planning agents need to balance breadth (exploring many actions at each level of the tree) and depth (exploring many levels in the tree) to allocate optimally their finite search capacity. We provide efficient analytical solutions and numerical analysis to the problem of allocating finite sampling capacity in one shot to large decision trees. We find that in general the optimal policy is to allocate few samples per level so that deep levels can be reached, thus favoring depth over breadth search. In contrast, in poor environments and at low capacity, it is best to broadly sample branches at the cost of not sampling deeply, although this policy is marginally better than deep allocations. Our results provide a theoretical foundation for the optimality of deep imagination for planning and show that it is a generally valid heuristic that could have evolved from the finite constraints of cognitive systems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2020

Optimal Policy Trees

We propose an approach for learning optimal tree-based prescription poli...
research
02/15/2023

Bayesian Decision Trees via Tractable Priors and Probabilistic Context-Free Grammars

Decision Trees are some of the most popular machine learning models toda...
research
07/24/2020

MurTree: Optimal Classification Trees via Dynamic Programming and Search

Decision tree learning is a widely used approach in machine learning, fa...
research
07/11/2019

On the Optimality of Trees Generated by ID3

Since its inception in the 1980s, ID3 has become one of the most success...
research
09/18/2017

Early prediction of the duration of protests using probabilistic Latent Dirichlet Allocation and Decision Trees

Protests and agitations are an integral part of every democratic civil s...
research
10/13/2022

Fast Optimization of Weighted Sparse Decision Trees for use in Optimal Treatment Regimes and Optimal Policy Design

Sparse decision trees are one of the most common forms of interpretable ...
research
02/07/2023

Towards Understanding the Effects of Evolving the MCTS UCT Selection Policy

Monte Carlo Tree Search (MCTS) is a sampling best-first method to search...

Please sign up or login with your details

Forgot password? Click here to reset