Assessing Human Error Against a Benchmark of Perfection

06/15/2016
by   Ashton Anderson, et al.
0

An increasing number of domains are providing us with detailed trace data on human decisions in settings where we can evaluate the quality of these decisions via an algorithm. Motivated by this development, an emerging line of work has begun to consider whether we can characterize and predict the kinds of decisions where people are likely to make errors. To investigate what a general framework for human error prediction might look like, we focus on a model system with a rich history in the behavioral sciences: the decisions made by chess players as they select moves in a game. We carry out our analysis at a large scale, employing datasets with several million recorded games, and using chess tablebases to acquire a form of ground truth for a subset of chess positions that have been completely solved by computers but remain challenging even for the best players in the world. We organize our analysis around three categories of features that we argue are present in most settings where the analysis of human error is applicable: the skill of the decision-maker, the time available to make the decision, and the inherent difficulty of the decision. We identify rich structure in all three of these categories of features, and find strong evidence that in our domain, features describing the inherent difficulty of an instance are significantly more powerful than features based on skill or time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/02/2020

Aligning Superhuman AI and Human Behavior: Chess as a Model System

As artificial intelligence becomes increasingly intelligent—in some case...
research
04/30/2021

Human strategic decision making in parametrized games

Many real-world games contain parameters which can affect payoffs, actio...
research
01/26/2022

Speed, Quality, and the Optimal Timing of Complex Decisions: Field Evidence

This paper presents an empirical investigation of the relation between d...
research
08/02/2022

Detecting Individual Decision-Making Style: Exploring Behavioral Stylometry in Chess

The advent of machine learning models that surpass human decision-making...
research
05/26/2023

Temporal Evolution of Risk Behavior in a Disease Spread Simulation

Human behavior is a dynamic process that evolves with experience. Unders...
research
08/23/2020

Learning Personalized Models of Human Behavior in Chess

Even when machine learning systems surpass human ability in a domain, th...
research
12/17/2020

Predicting Decisions in Language Based Persuasion Games

Sender-receiver interactions, and specifically persuasion games, are wid...

Please sign up or login with your details

Forgot password? Click here to reset