Train on Small, Play the Large: Scaling Up Board Games with AlphaZero and GNN

07/18/2021
by   Shai Ben-Assayag, et al.
0

Playing board games is considered a major challenge for both humans and AI researchers. Because some complicated board games are quite hard to learn, humans usually begin with playing on smaller boards and incrementally advance to master larger board strategies. Most neural network frameworks that are currently tasked with playing board games neither perform such incremental learning nor possess capabilities to automatically scale up. In this work, we look at the board as a graph and combine a graph neural network architecture inside the AlphaZero framework, along with some other innovative improvements. Our ScalableAlphaZero is capable of learning to play incrementally on small boards, and advancing to play on large ones. Our model can be trained quickly to play different challenging board games on multiple board sizes, without using any domain knowledge. We demonstrate the effectiveness of ScalableAlphaZero and show, for example, that by training it for only three days on small Othello boards, it can defeat the AlphaZero model on a large board, which was trained to play the large board for 30 days.

READ FULL TEXT
research
09/01/2022

A Technique to Create Weaker Abstract Board Game Agents via Reinforcement Learning

Board games, with the exception of solo games, need at least one other p...
research
11/17/2017

Learning to Play Othello with Deep Neural Networks

Achieving superhuman playing level by AlphaGo corroborated the capabilit...
research
04/26/2020

Warm-Start AlphaZero Self-Play Search Enhancements

Recently, AlphaZero has achieved landmark results in deep reinforcement ...
research
01/27/2020

Polygames: Improved Zero Learning

Since DeepMind's AlphaZero, Zero learning quickly became the state-of-th...
research
06/05/2014

Systematic N-tuple Networks for Position Evaluation: Exceeding 90 the Othello League

N-tuple networks have been successfully used as position evaluation func...
research
09/11/2018

SAI, a Sensible Artificial Intelligence that plays Go

We propose a multiple-komi modification of the AlphaGo Zero/Leela Zero p...
research
03/29/2023

Concise QBF Encodings for Games on a Grid (extended version)

Encoding 2-player games in QBF correctly and efficiently is challenging ...

Please sign up or login with your details

Forgot password? Click here to reset