Trajectory Inspection: A Method for Iterative Clinician-Driven Design of Reinforcement Learning Studies

10/08/2020
by   Christina X. Ji, et al.
9

Treatment policies learned via reinforcement learning (RL) from observational health data are sensitive to subtle choices in study design. We highlight a simple approach, trajectory inspection, to bring clinicians into an iterative design process for model-based RL studies. We inspect trajectories where the model recommends unexpectedly aggressive treatments or believes its recommendations would lead to much more positive outcomes. Then, we examine clinical trajectories simulated with the learned model and policy alongside the actual hospital course to uncover possible modeling issues. To demonstrate that this approach yields insights, we apply it to recent work on RL for inpatient sepsis management. We find that a design choice around maximum trajectory length leads to a model bias towards discharge, that the RL policy preference for high vasopressor doses may be linked to small sample sizes, and that the model has a clinically implausible expectation of discharge without weaning off vasopressors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2019

Combining Benefits from Trajectory Optimization and Deep Reinforcement Learning

Recent breakthroughs both in reinforcement learning and trajectory optim...
research
01/09/2021

Identifying Decision Points for Safe and Interpretable Reinforcement Learning in Hypotension Treatment

Many batch RL health applications first discretize time into fixed inter...
research
11/17/2020

Explaining Conditions for Reinforcement Learning Behaviors from Real and Imagined Data

The deployment of reinforcement learning (RL) in the real world comes wi...
research
01/09/2020

Identifying Distinct, Effective Treatments for Acute Hypotension with SODA-RL: Safely Optimized Diverse Accurate Reinforcement Learning

Hypotension in critical care settings is a life-threatening emergency th...
research
05/31/2018

Evaluating Reinforcement Learning Algorithms in Observational Health Settings

Much attention has been devoted recently to the development of machine l...
research
11/29/2022

Learning and Understanding a Disentangled Feature Representation for Hidden Parameters in Reinforcement Learning

Hidden parameters are latent variables in reinforcement learning (RL) en...
research
11/29/2022

Symmetry Detection in Trajectory Data for More Meaningful Reinforcement Learning Representations

Knowledge of the symmetries of reinforcement learning (RL) systems can b...

Please sign up or login with your details

Forgot password? Click here to reset