DeepAI AI Chat
Log In Sign Up

Generalized Nested Rollout Policy Adaptation

by   Tristan Cazenave, et al.

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to generalize NRPA with a temperature and a bias and to analyze theoretically the algorithms. The generalized algorithm is named GNRPA. Experiments show it improves on NRPA for different application domains: SameGame and the Traveling Salesman Problem with Time Windows.


page 1

page 2

page 3

page 4


Stabilized Nested Rollout Policy Adaptation

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorith...

Refutation of Spectral Graph Theory Conjectures with Monte Carlo Search

We demonstrate how Monte Carlo Search (MCS) algorithms, namely Nested Mo...

Generalized Nested Rollout Policy Adaptation with Dynamic Bias for Vehicle Routing

In this paper we present an extension of the Nested Rollout Policy Adapt...

Monte Carlo Search Algorithm Discovery for One Player Games

Much current research in AI and games is being devoted to Monte Carlo se...

Monkey Business: Reinforcement learning meets neighborhood search for Virtual Network Embedding

In this article, we consider the Virtual Network Embedding (VNE) problem...

Bayesian Policy Search for Stochastic Domains

AI planning can be cast as inference in probabilistic models, and probab...

Nested Zero Inflated Generalized Poisson Regression for FIFA World Cup 2022

This article is devoted to the forecast of the FIFA World Cup 2022 via n...