Generalized Nested Rollout Policy Adaptation

03/22/2020
by   Tristan Cazenave, et al.
0

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2021

Stabilized Nested Rollout Policy Adaptation

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorith...
research
07/04/2022

Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Mo...
research
11/12/2021

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

In this paper we present an extension of the Nested Rollout Policy Adapt...
research
08/23/2012

Monte Carlo Search Algorithm Discovery for One Player Games

Much current research in AI and games is being devoted to Monte Carlo se...
research
02/28/2022

Monkey Business: Reinforcement learning meets neighborhood search for Virtual Network Embedding

In this article, we consider the Virtual Network Embedding (VNE) problem...
research
10/01/2020

Bayesian Policy Search for Stochastic Domains

AI planning can be cast as inference in probabilistic models, and probab...
research
05/09/2022

Nested Zero Inflated Generalized Poisson Regression for FIFA World Cup 2022

This article is devoted to the forecast of the FIFA World Cup 2022 via n...

Please sign up or login with your details

Forgot password? Click here to reset