Static and Dynamic Values of Computation in MCTS

02/11/2020
by   Eren Sezener, et al.
0

Monte-Carlo Tree Search (MCTS) is one of the most-widely used methods for planning, and has powered many recent advances in artificial intelligence. In MCTS, one typically performs computations (i.e., simulations) to collect statistics about the possible future consequences of actions, and then chooses accordingly. Many popular MCTS methods such as UCT and its variants decide which computations to perform by trading-off exploration and exploitation. In this work, we take a more direct approach, and explicitly quantify the value of a computation based on its expected impact on the quality of the action eventually chosen. Our approach goes beyond the "myopic" limitations of existing computation-value-based methods in two senses: (I) we are able to account for the impact of non-immediate (ie, future) computations (II) on non-immediate actions. We show that policies that greedily optimize computation values are optimal under certain assumptions and obtain results that are competitive with the state-of-the-art.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/07/2018

Computing the Value of Computation for Planning

An intelligent agent performs actions in order to achieve its goals. Suc...
research
03/25/2021

Active Tree Search in Large POMDPs

Model-based planning and prospection are widely studied in both cognitiv...
research
01/23/2013

Model-Based Bayesian Exploration

Reinforcement learning systems are often concerned with balancing explor...
research
02/11/2022

A Unified Perspective on Value Backup and Exploration in Monte-Carlo Tree Search

Monte-Carlo Tree Search (MCTS) is a class of methods for solving complex...
research
02/13/2018

Learning to Search with MCTSnets

Planning problems are among the most important and well-studied problems...
research
03/22/2018

DOP: Deep Optimistic Planning with Approximate Value Function Evaluation

Research on reinforcement learning has demonstrated promising results in...
research
03/13/2018

Fractal AI: A fragile theory of intelligence

Fractal AI is a theory for general artificial intelligence. It allows to...

Please sign up or login with your details

Forgot password? Click here to reset