Interpretable Off-Policy Learning via Hyperbox Search

03/04/2022
by   Daniel Tschernutter, et al.
0

Personalized treatment decisions have become an integral part of modern medicine. Thereby, the aim is to make treatment decisions based on individual patient characteristics. Numerous methods have been developed for learning such policies from observational data that achieve the best outcome across a certain policy class. Yet these methods are rarely interpretable. However, interpretability is often a prerequisite for policy learning in clinical practice. In this paper, we propose an algorithm for interpretable off-policy learning via hyperbox search. In particular, our policies can be represented in disjunctive normal form (i.e., OR-of-ANDs) and are thus intelligible. We prove a universal approximation theorem that shows that our policy class is flexible enough to approximate any measurable function arbitrarily well. For optimization, we develop a tailored column generation procedure within a branch-and-bound framework. Using a simulation study, we demonstrate that our algorithm outperforms state-of-the-art methods from interpretable off-policy learning in terms of regret. Using real-word clinical data, we perform a user study with actual clinical experts, who rate our policies as highly interpretable.

READ FULL TEXT

page 20

page 26

research
10/10/2018

Bayesian Nonparametric Policy Search with Application to Periodontal Recall Intervals

Tooth loss from periodontal disease is a major public health burden in t...
research
07/02/2020

Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score

Randomized controlled trials typically analyze the effectiveness of trea...
research
10/11/2021

CAPITAL: Optimal Subgroup Identification via Constrained Policy Tree Search

Personalized medicine, a paradigm of medicine tailored to a patient's ch...
research
01/09/2020

Personalized Policy Learning using Longitudinal Mobile Health Data

We address the personalized policy learning problem using longitudinal m...
research
11/05/2021

Distilling Heterogeneity: From Explanations of Heterogeneous Treatment Effect Models to Interpretable Policies

Internet companies are increasingly using machine learning models to cre...
research
12/18/2018

Interpretable Optimal Stopping

Optimal stopping is the problem of deciding when to stop a stochastic sy...
research
06/20/2019

More Efficient Policy Learning via Optimal Retargeting

Policy learning can be used to extract individualized treatment regimes ...

Please sign up or login with your details

Forgot password? Click here to reset