The Second Type of Uncertainty in Monte Carlo Tree Search

05/19/2020
by   Thomas M. Moerland, et al.
16

Monte Carlo Tree Search (MCTS) efficiently balances exploration and exploitation in tree search based on count-derived uncertainty. However, these local visit counts ignore a second type of uncertainty induced by the size of the subtree below an action. We first show how, due to the lack of this second uncertainty type, MCTS may completely fail in well-known sparse exploration problems, known from the reinforcement learning community. We then introduce a new algorithm, which estimates the size of the subtree below an action, and leverages this information in the UCB formula to better direct exploration. Subsequently, we generalize these ideas by showing that loops, i.e., the repeated occurrence of (approximately) the same state in the same trace, are actually a special case of subtree depth variation. Testing on a variety of tasks shows that our algorithms increase sample efficiency, especially when the planning budget per timestep is small.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2018

Monte Carlo Tree Search for Asymmetric Trees

We present an extension of Monte Carlo Tree Search (MCTS) that strongly ...
research
07/01/2020

Convex Regularization in Monte-Carlo Tree Search

Monte-Carlo planning and Reinforcement Learning (RL) are essential to se...
research
05/14/2012

Efficient Bayes-Adaptive Reinforcement Learning using Sample-Based Search

Bayesian model-based reinforcement learning is a formally elegant approa...
research
12/03/2013

Combining Simulated Annealing and Monte Carlo Tree Search for Expression Simplification

In many applications of computer algebra large expressions must be simpl...
research
05/11/2013

Geiringer Theorems: From Population Genetics to Computational Intelligence, Memory Evolutive Systems and Hebbian Learning

The classical Geiringer theorem addresses the limiting frequency of occu...
research
02/15/2020

Legion: Best-First Concolic Testing

Legion is a grey-box concolic tool that aims to balance the complementar...
research
11/22/2022

Monte Carlo Forest Search: UNSAT Solver Synthesis via Reinforcement learning

We introduce Monte Carlo Forest Search (MCFS), an offline algorithm for ...

Please sign up or login with your details

Forgot password? Click here to reset