Accelerating Empowerment Computation with UCT Tree Search

03/27/2018
by   Christoph Salge, et al.
0

Models of intrinsic motivation present an important means to produce sensible behaviour in the absence of extrinsic rewards. Applications in video games are varied, and range from intrinsically motivated general game-playing agents to non-player characters such as companions and enemies. The information-theoretic quantity of Empowerment is a particularly promising candidate motivation to produce believable, generic and robust behaviour. However, while it can be used in the absence of external reward functions that would need to be crafted and learned, empowerment is computationally expensive. In this paper, we propose a modified UCT tree search method to mitigate empowerment's computational complexity in discrete and deterministic scenarios. We demonstrate how to modify a Monte-Carlo Search Tree with UCT to realise empowerment maximisation, and discuss three additional modifications that facilitate better sampling. We evaluate the approach both quantitatively, by analysing how close our approach gets to the baseline of exhaustive empowerment computation with varying amounts of computational resources, and qualitatively, by analysing the resulting behaviour in a Minecraft-like scenario.

READ FULL TEXT

page 5

page 7

research
10/29/2019

Multiplayer AlphaZero

The AlphaZero algorithm has achieved superhuman performance in two-playe...
research
03/17/2020

Enhancing the Monte Carlo Tree Search Algorithm for Video Game Testing

In this paper, we study the effects of several Monte Carlo Tree Search (...
research
06/04/2018

New And Surprising Ways to Be Mean. Adversarial NPCs with Coupled Empowerment Minimisation

Creating Non-Player Characters (NPCs) that can react robustly to unfores...
research
03/08/2021

Monte Carlo Tree Search: A Review of Recent Modifications and Applications

Monte Carlo Tree Search (MCTS) is a powerful approach to designing game-...
research
01/14/2019

Ordinal Monte Carlo Tree Search

In many problem settings, most notably in game playing, an agent receive...
research
06/03/2014

Changing the Environment Based on Empowerment as Intrinsic Motivation

One aspect of intelligence is the ability to restructure your own enviro...
research
05/31/2019

Ordinal Bucketing for Game Trees using Dynamic Quantile Approximation

In this paper, we present a simple and cheap ordinal bucketing algorithm...

Please sign up or login with your details

Forgot password? Click here to reset