Policy learning for many outcomes of interest: Combining optimal policy trees with multi-objective Bayesian optimisation

12/13/2022
by   Patrick Rehill, et al.
0

Methods for learning optimal policies use causal machine learning models to create human-interpretable rules for making choices around the allocation of different policy interventions. However, in realistic policy-making contexts, decision-makers often care about trade-offs between outcomes, not just singlemindedly maximising utility for one outcome. This paper proposes an approach termed Multi-Objective Policy Learning (MOPoL) which combines optimal decision trees for policy learning with a multi-objective Bayesian optimisation approach to explore the trade-off between multiple outcomes. It does this by building a Pareto frontier of non-dominated models for different hyperparameter settings. The key here is that a low-cost surrogate function can be an accurate proxy for the very computationally costly optimal tree in terms of expected regret. This surrogate can be fit many times with different hyperparameter values to proxy the performance of the optimal model. The method is applied to a real-world case-study of conditional cash transfers in Morocco where hybrid (partially optimal, partially greedy) policy trees provide good performance as a surrogate for optimal trees while being computationally cheap enough to feasibly fit a Pareto frontier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2022

R-MBO: A Multi-surrogate Approach for Preference Incorporation in Multi-objective Bayesian Optimisation

Many real-world multi-objective optimisation problems rely on computatio...
research
02/08/2021

Multi-Objective Learning to Predict Pareto Fronts Using Hypervolume Maximization

Real-world problems are often multi-objective with decision-makers unabl...
research
08/29/2022

Generalization In Multi-Objective Machine Learning

Modern machine learning tasks often require considering not just one but...
research
04/13/2023

Learning Personalized Decision Support Policies

Individual human decision-makers may benefit from different forms of sup...
research
12/03/2020

Optimal Policy Trees

We propose an approach for learning optimal tree-based prescription poli...
research
10/19/2021

Learning Pareto-Efficient Decisions with Confidence

The paper considers the problem of multi-objective decision support when...
research
06/05/2023

Random Distribution Shift in Refugee Placement: Strategies for Building Robust Models

Algorithmic assignment of refugees and asylum seekers to locations withi...

Please sign up or login with your details

Forgot password? Click here to reset