Aligning Superhuman AI and Human Behavior: Chess as a Model System

06/02/2020
by   Reid McIlroy-Young, et al.
10

As artificial intelligence becomes increasingly intelligent—in some cases, achieving superhuman performance—there is growing potential for humans to learn from and collaborate with algorithms. However, the ways in which AI systems approach problems are often different from the ways people do, and thus may be uninterpretable and hard to learn from. A crucial step in bridging this gap between human and artificial intelligence is modeling the granular actions that constitute human behavior, rather than simply matching aggregate human performance. We pursue this goal in a model system with a long history in artificial intelligence: chess. The aggregate performance of a chess player unfolds as they make decisions over the course of a game. The hundreds of millions of games played online by players at every skill level form a rich source of data in which these decisions, and their exact context, are recorded in minute detail. Applying existing chess engines to this data, including an open-source implementation of AlphaZero, we find that they do not predict human moves well. We develop and introduce Maia, a customized version of Alpha-Zero trained on human chess games, that predicts human moves at a much higher accuracy than existing engines, and can achieve maximum accuracy when predicting decisions made by players at a specific skill level in a tuneable way. For a dual task of predicting whether a human will make a large mistake on the next move, we develop a deep neural network that significantly outperforms competitive baselines. Taken together, our results suggest that there is substantial promise in designing artificial intelligence systems with human collaboration in mind by first accurately modeling granular human decision-making.

READ FULL TEXT
research
09/27/2019

Beating humans in a penny-matching game by leveraging cognitive hierarchy theory and Bayesian learning

It is a long-standing goal of artificial intelligence (AI) to be superio...
research
08/23/2020

Learning Personalized Models of Human Behavior in Chess

Even when machine learning systems surpass human ability in a domain, th...
research
11/23/2017

Improvised Comedy as a Turing Test

The best improvisational theatre actors can make any scene partner, of a...
research
06/15/2016

Assessing Human Error Against a Benchmark of Perfection

An increasing number of domains are providing us with detailed trace dat...
research
11/19/2015

Better Computer Go Player with Neural Network and Long-term Prediction

Competing with top human players in the ancient game of Go has been a lo...
research
03/09/2018

Institutional Metaphors for Designing Large-Scale Distributed AI versus AI Techniques for Running Institutions

Artificial Intelligence (AI) started out with an ambition to reproduce t...
research
11/09/2018

Analysis of Fleet Modularity in an Artificial Intelligence-Based Attacker-Defender Game

Because combat environments change over time and technology upgrades are...

Please sign up or login with your details

Forgot password? Click here to reset