Using Graph Convolutional Networks and TD(λ) to play the game of Risk

09/10/2020
by   Jamie Carr, et al.
0

Risk is 6 player game with significant randomness and a large game-tree complexity which poses a challenge to creating an agent to play the game effectively. Previous AIs focus on creating high-level handcrafted features determine agent decision making. In this project, I create D.A.D, A Risk agent using temporal difference reinforcement learning to train a Deep Neural Network including a Graph Convolutional Network to evaluate player positions. This is used in a game-tree to select optimal moves. This allows minimal handcrafting of knowledge into the AI, assuring input features are as low-level as possible to allow the network to extract useful and sophisticated features itself, even with the network starting from a random initialisation. I also tackle the issue of non-determinism in Risk by introducing a new method of interpreting attack moves necessary for the search. The result is an AI which wins 35 versus 5 of best inbuilt AIs in Lux Delux, a Risk variant.

READ FULL TEXT
research
08/30/2018

Application of Self-Play Reinforcement Learning to a Four-Player Game of Imperfect Information

We introduce a new virtual environment for simulating a card game known ...
research
06/30/2022

Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning

We introduce DeepNash, an autonomous agent capable of learning to play t...
research
09/11/2017

Combining Strategic Learning and Tactical Search in Real-Time Strategy Games

A commonly used technique for managing AI complexity in real-time strate...
research
07/02/2019

Playing Go without Game Tree Search Using Convolutional Neural Networks

The game of Go has a long history in East Asian countries, but the field...
research
05/23/2017

Thinking Fast and Slow with Deep Learning and Tree Search

Sequential decision making problems, such as structured prediction, robo...
research
02/17/2018

A Deep Q-Learning Agent for the L-Game with Variable Batch Training

We employ the Deep Q-Learning algorithm with Experience Replay to train ...
research
05/16/2023

RAMario: Experimental Approach to Reptile Algorithm – Reinforcement Learning for Mario

This research paper presents an experimental approach to using the Repti...

Please sign up or login with your details

Forgot password? Click here to reset