Stabilized Nested Rollout Policy Adaptation

01/10/2021
by   Tristan Cazenave, et al.
0

Nested Rollout Policy Adaptation (NRPA) is a Monte Carlo search algorithm for single player games. In this paper we propose to modify NRPA in order to improve the stability of the algorithm. Experiments show it improves the algorithm for different application domains: SameGame, Traveling Salesman with Time Windows and Expression Discovery.

READ FULL TEXT

Please sign up or login with your details

Forgot password? Click here to reset