Single-Agent Policy Tree Search With Guarantees

11/27/2018
by   Laurent Orseau, et al.
0

We introduce two novel tree search algorithms that use a policy to guide search. The first algorithm is a best-first enumeration that uses a cost function that allows us to prove an upper bound on the number of nodes to be expanded before reaching a goal state. We show that this best-first algorithm is particularly well suited for `needle-in-a-haystack' problems. The second algorithm is based on sampling and we prove an upper bound on the expected number of nodes it expands before reaching a set of goal states. We show that this algorithm is better suited for problems where many paths lead to a goal. We validate these tree search algorithms on 1,000 computer-generated levels of Sokoban, where the policy used to guide the search comes from a neural network trained using A3C. Our results show that the policy tree search algorithms we introduce are competitive with a state-of-the-art domain-independent planner that uses heuristic search.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2021

Policy-Guided Heuristic Search with Guarantees

The use of a policy and a heuristic function for guiding search can be q...
research
05/22/2020

Single-Agent Optimization Through Policy Iteration Using Monte-Carlo Tree Search

The combination of Monte-Carlo Tree Search (MCTS) and deep reinforcement...
research
05/26/2023

Levin Tree Search with Context Models

Levin Tree Search (LTS) is a search algorithm that makes use of a policy...
research
11/28/2022

Learning to design without prior data: Discovering generalizable design strategies using deep learning and tree search

Building an AI agent that can design on its own has been a goal since th...
research
07/30/2019

Iterative Budgeted Exponential Search

We tackle two long-standing problems related to re-expansions in heurist...
research
02/07/2023

Towards Understanding the Effects of Evolving the MCTS UCT Selection Policy

Monte Carlo Tree Search (MCTS) is a sampling best-first method to search...
research
02/08/2021

A* Search Without Expansions: Learning Heuristic Functions with Deep Q-Networks

A* search is an informed search algorithm that uses a heuristic function...

Please sign up or login with your details

Forgot password? Click here to reset