Should I tear down this wall? Optimizing social metrics by evaluating novel actions

04/16/2020
by   János Kramár, et al.
0

One of the fundamental challenges of governance is deciding when and how to intervene in multi-agent systems in order to impact group-wide metrics of success. This is particularly challenging when proposed interventions are novel and expensive. For example, one may wish to modify a building's layout to improve the efficiency of its escape route. Evaluating such interventions would generally require access to an elaborate simulator, which must be constructed ad-hoc for each environment, and can be prohibitively costly or inaccurate. Here we examine a simple alternative: Optimize By Observational Extrapolation (OBOE). The idea is to use observed behavioural trajectories, without any interventions, to learn predictive models mapping environment states to individual agent outcomes, and then use these to evaluate and select changes. We evaluate OBOE in socially complex gridworld environments and consider novel physical interventions that our models were not trained on. We show that neural network models trained to predict agent returns on baseline environments are effective at selecting among the interventions. Thus, OBOE can provide guidance for challenging questions like: "which wall should I tear down in order to minimize the Gini index of this group?"

READ FULL TEXT
research
09/22/2022

Developing, Evaluating and Scaling Learning Agents in Multi-Agent Environments

The Game Theory Multi-Agent team at DeepMind studies several aspects...
research
07/15/2019

Defining mediation effects for multiple mediators using the concept of the target randomized trial

Causal mediation approaches have been primarily developed for the goal o...
research
02/12/2020

Resolving Spurious Correlations in Causal Models of Environments via Interventions

Causal models could increase interpretability, robustness to distributio...
research
10/11/2022

A General Learning Framework for Open Ad Hoc Teamwork Using Graph-based Policy Learning

Open ad hoc teamwork is the problem of training a single agent to effici...
research
12/16/2021

On Optimizing Interventions in Shared Autonomy

Shared autonomy refers to approaches for enabling an autonomous agent to...

Please sign up or login with your details

Forgot password? Click here to reset