Gamifying optimization: a Wasserstein distance-based analysis of human search

12/12/2021
by   Antonio Candelieri, et al.
0

The main objective of this paper is to outline a theoretical framework to characterise humans' decision-making strategies under uncertainty, in particular active learning in a black-box optimization task and trading-off between information gathering (exploration) and reward seeking (exploitation). Humans' decisions making according to these two objectives can be modelled in terms of Pareto rationality. If a decision set contains a Pareto efficient strategy, a rational decision maker should always select the dominant strategy over its dominated alternatives. A distance from the Pareto frontier determines whether a choice is Pareto rational. To collect data about humans' strategies we have used a gaming application that shows the game field, with previous decisions and observations, as well as the score obtained. The key element in this paper is the representation of behavioural patterns of human learners as a discrete probability distribution. This maps the problem of the characterization of humans' behaviour into a space whose elements are probability distributions structured by a distance between histograms, namely the Wasserstein distance (WST). The distributional analysis gives new insights about human search strategies and their deviations from Pareto rationality. Since the uncertainty is one of the two objectives defining the Pareto frontier, the analysis has been performed for three different uncertainty quantification measures to identify which better explains the Pareto compliant behavioural patterns. Beside the analysis of individual patterns WST has also enabled a global analysis computing the barycenters and WST k-means clustering. A further analysis has been performed by a decision tree to relate non-Paretian behaviour, characterized by exasperated exploitation, to the dynamics of the evolution of the reward seeking process.

READ FULL TEXT

page 21

page 30

page 31

research
05/15/2023

Mastering the exploration-exploitation trade-off in Bayesian Optimization

Gaussian Process based Bayesian Optimization is a well-known sample effi...
research
12/07/2021

Multi-Task Learning on Networks

The multi-task learning (MTL) paradigm can be traced back to an early pa...
research
08/17/2022

Ban The Box? Information, Incentives, and Statistical Discrimination

"Banning the Box" refers to a policy campaign aimed at prohibiting emplo...
research
03/09/2020

Modelling Human Active Search in Optimizing Black-box Functions

Modelling human function learning has been the subject of in-tense resea...
research
02/27/2022

Pareto-Rational Verification

We study the rational verification problem which consists in verifying t...
research
01/09/2023

Do Performance Aspirations Matter for Guiding Software Configuration Tuning?

Configurable software systems can be tuned for better performance. Lever...

Please sign up or login with your details

Forgot password? Click here to reset