Generalizing Off-Policy Evaluation From a Causal Perspective For Sequential Decision-Making

01/20/2022
by   Sonali Parbhoo, et al.
0

Assessing the effects of a policy based on observational data from a different policy is a common problem across several high-stake decision-making domains, and several off-policy evaluation (OPE) techniques have been proposed. However, these methods largely formulate OPE as a problem disassociated from the process used to generate the data (i.e. structural assumptions in the form of a causal graph). We argue that explicitly highlighting this association has important implications on our understanding of the fundamental limits of OPE. First, this implies that current formulation of OPE corresponds to a narrow set of tasks, i.e. a specific causal estimand which is focused on prospective evaluation of policies over populations or sub-populations. Second, we demonstrate how this association motivates natural desiderata to consider a general set of causal estimands, particularly extending the role of OPE for counterfactual off-policy evaluation at the level of individuals of the population. A precise description of the causal estimand highlights which OPE estimands are identifiable from observational data under the stated generative assumptions. For those OPE estimands that are not identifiable, the causal perspective further highlights where more experimental data is necessary, and highlights situations where human expertise can aid identification and estimation. Furthermore, many formalisms of OPE overlook the role of uncertainty entirely in the estimation process.We demonstrate how specifically characterising the causal estimand highlights the different sources of uncertainty and when human expertise can naturally manage this uncertainty. We discuss each of these aspects as actionable desiderata for future OPE research at scale and in-line with practical utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/20/2022

Counterfactual Learning with Multioutput Deep Kernels

In this paper, we address the challenge of performing counterfactual inf...
research
09/05/2023

s-ID: Causal Effect Identification in a Sub-Population

Causal inference in a sub-population involves identifying the causal eff...
research
04/08/2021

Causal Decision Making and Causal Effect Estimation Are Not the Same... and Why It Matters

Causal decision making (CDM) at scale has become a routine part of busin...
research
07/03/2018

Playing against Nature: causal discovery for decision making under uncertainty

We consider decision problems under uncertainty where the options availa...
research
05/14/2019

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

We introduce an off-policy evaluation procedure for highlighting episode...
research
10/17/2019

Comment: Reflections on the Deconfounder

The aim of this comment (set to appear in a formal discussion in JASA) i...
research
11/28/2021

Identification of Subgroups With Similar Benefits in Off-Policy Policy Evaluation

Off-policy policy evaluation methods for sequential decision making can ...

Please sign up or login with your details

Forgot password? Click here to reset