Counterfactually Guided Policy Transfer in Clinical Settings

06/20/2020
by   Taylor W. Killian, et al.
0

Reliably transferring treatment policies learned in one clinical environment to another is currently limited by challenges related to domain shift. In this paper we address off-policy learning for sequential decision making under domain shift – a scenario susceptible to catastrophic overconfidence – which is highly relevant to a high-stakes clinical settings where the target domain may also be data-scarce. We propose a two-fold counterfactual regularization procedure to improve off-policy learning, addressing domain shift and data scarcity. First, we utilize an informative prior derived from a data-rich source environment to indirectly improve drawing counterfactual example observations. Then, these samples are then used to learn a policy for the target domain, regularized by the source policy through KL-divergence. In simulated sepsis treatment, our counterfactual policy transfer procedure significantly improves the performance of a learned treatment policy.

READ FULL TEXT

page 22

page 23

research
06/10/2022

Adversarial Counterfactual Environment Model Learning

A good model for action-effect prediction, named environment model, is i...
research
10/27/2021

Transfer learning with causal counterfactual reasoning in Decision Transformers

The ability to adapt to changes in environmental contingencies is an imp...
research
05/14/2019

Counterfactual Off-Policy Evaluation with Gumbel-Max Structural Causal Models

We introduce an off-policy evaluation procedure for highlighting episode...
research
05/04/2023

ReMask: A Robust Information-Masking Approach for Domain Counterfactual Generation

Domain shift is a big challenge in NLP, thus, many approaches resort to ...
research
06/06/2023

Counterfactual Explanations and Predictive Models to Enhance Clinical Decision-Making in Schizophrenia using Digital Phenotyping

Clinical practice in psychiatry is burdened with the increased demand fo...
research
07/01/2021

Policy Transfer across Visual and Dynamics Domain Gaps via Iterative Grounding

The ability to transfer a policy from one environment to another is a pr...
research
08/09/2018

Counterfactual Normalization: Proactively Addressing Dataset Shift and Improving Reliability Using Causal Mechanisms

Predictive models can fail to generalize from training to deployment env...

Please sign up or login with your details

Forgot password? Click here to reset