Understanding Sampling Style Adversarial Search Methods

03/15/2012
by   Raghuram Ramanujan, et al.
0

UCT has recently emerged as an exciting new adversarial reasoning technique based on cleverly balancing exploration and exploitation in a Monte-Carlo sampling setting. It has been particularly successful in the game of Go but the reasons for its success are not well understood and attempts to replicate its success in other domains such as Chess have failed. We provide an in-depth analysis of the potential of UCT in domain-independent settings, in cases where heuristic values are available, and the effect of enhancing random playouts to more informed playouts between two weak minimax players. To provide further insights, we develop synthetic game tree instances and discuss interesting properties of UCT, both empirically and analytically.

READ FULL TEXT
research
06/02/2014

Monte Carlo Tree Search with Heuristic Evaluations using Implicit Minimax Backups

Monte Carlo Tree Search (MCTS) has improved the performance of game engi...
research
07/12/2018

Monte Carlo Methods for the Game Kingdomino

Kingdomino is introduced as an interesting game for studying game playin...
research
12/10/2022

Lookahead Pathology in Monte-Carlo Tree Search

Monte-Carlo Tree Search (MCTS) is an adversarial search paradigm that fi...
research
12/17/2021

On the Evolution of the MCTS Upper Confidence Bounds for Trees by Means of Evolutionary Algorithms in the Game of Carcassonne

Monte Carlo Tree Search (MCTS) is a sampling best-first method to search...
research
05/16/2023

Scale-Adaptive Balancing of Exploration and Exploitation in Classical Planning

Balancing exploration and exploitation has been an important problem in ...
research
10/20/2011

A Version of Geiringer-like Theorem for Decision Making in the Environments with Randomness and Incomplete Information

Purpose: In recent years Monte-Carlo sampling methods, such as Monte Car...

Please sign up or login with your details

Forgot password? Click here to reset