Adapting Improved Upper Confidence Bounds for Monte-Carlo Tree Search

05/11/2015
by   Yun-Ching Liu, et al.
0

The UCT algorithm, which combines the UCB algorithm and Monte-Carlo Tree Search (MCTS), is currently the most widely used variant of MCTS. Recently, a number of investigations into applying other bandit algorithms to MCTS have produced interesting results. In this research, we will investigate the possibility of combining the improved UCB algorithm, proposed by Auer et al. (2010), with MCTS. However, various characteristics and properties of the improved UCB algorithm may not be ideal for a direct application to MCTS. Therefore, some modifications were made to the improved UCB algorithm, making it more suitable for the task of game tree search. The Mi-UCT algorithm is the application of the modified UCB algorithm applied to trees. The performance of Mi-UCT is demonstrated on the games of 9× 9 Go and 9× 9 NoGo, and has shown to outperform the plain UCT algorithm when only a small number of playouts are given, and rougly on the same level when more playouts are available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/20/2020

Monte-Carlo Graph Search for AlphaZero

The AlphaZero algorithm has been successfully applied in a range of disc...
research
06/08/2022

Combining Monte-Carlo Tree Search with Proof-Number Search

Proof-Number Search (PNS) and Monte-Carlo Tree Search (MCTS) have been s...
research
05/23/2018

Monte Carlo Tree Search for Asymmetric Trees

We present an extension of Monte Carlo Tree Search (MCTS) that strongly ...
research
05/27/2020

ProTuner: Tuning Programs with Monte Carlo Tree Search

We explore applying the Monte Carlo Tree Search (MCTS) algorithm in a no...
research
12/10/2022

Lookahead Pathology in Monte-Carlo Tree Search

Monte-Carlo Tree Search (MCTS) is an adversarial search paradigm that fi...
research
06/15/2020

On Effective Parallelization of Monte Carlo Tree Search

Despite its groundbreaking success in Go and computer games, Monte Carlo...
research
06/08/2020

POLY-HOOT: Monte-Carlo Planning in Continuous Space MDPs with Non-Asymptotic Analysis

Monte-Carlo planning, as exemplified by Monte-Carlo Tree Search (MCTS), ...

Please sign up or login with your details

Forgot password? Click here to reset