An optimal learning method for developing personalized treatment regimes

07/06/2016
by   Yingfei Wang, et al.
0

A treatment regime is a function that maps individual patient information to a recommended treatment, hence explicitly incorporating the heterogeneity in need for treatment across individuals. Patient responses are dichotomous and can be predicted through an unknown relationship that depends on the patient information and the selected treatment. The goal is to find the treatments that lead to the best patient responses on average. Each experiment is expensive, forcing us to learn the most from each experiment. We adopt a Bayesian approach both to incorporate possible prior information and to update our treatment regime continuously as information accrues, with the potential to allow smaller yet more informative trials and for patients to receive better treatment. By formulating the problem as contextual bandits, we introduce a knowledge gradient policy to guide the treatment assignment by maximizing the expected value of information, for which an approximation method is used to overcome computational challenges. We provide a detailed study on how to make sequential medical decisions under uncertainty to reduce health care costs on a real world knee replacement dataset. We use clustering and LASSO to deal with the intrinsic sparsity in health datasets. We show experimentally that even though the problem is sparse, through careful selection of physicians (versus picking them at random), we can significantly improve the success rates.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2022

Using Pilot Data to Size Observational Studies for the Estimation of Dynamic Treatment Regimes

There has been significant attention given to developing data-driven met...
research
11/10/2016

Estimating Dynamic Treatment Regimes in Mobile Health Using V-learning

The vision for precision medicine is to use individual patient character...
research
07/02/2020

Learning Individualized Treatment Rules with Estimated Translated Inverse Propensity Score

Randomized controlled trials typically analyze the effectiveness of trea...
research
05/29/2023

Contextual Bandits with Budgeted Information Reveal

Contextual bandit algorithms are commonly used in digital health to reco...
research
09/13/2017

Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks

We consider the problem of sequentially making decisions that are reward...
research
03/08/2022

Average Response Curves for Treatment Time in the Emergency Department

We estimate average responses curves for treatment time in the Emergency...
research
05/10/2023

Planning a Community Approach to Diabetes Care in Low- and Middle-Income Countries Using Optimization

Diabetes is a global health priority, especially in low- and-middle-inco...

Please sign up or login with your details

Forgot password? Click here to reset