Making SMART decisions in prophylaxis and treatment studies

03/24/2022
by   Robert K. Mahar, et al.
0

The optimal prophylaxis, and treatment if the prophylaxis fails, for a disease may be best evaluated using a sequential multiple assignment randomised trial (SMART). A SMART is a multi-stage study that randomises a participant to an initial treatment, observes some response to that treatment and then, depending on their observed response, randomises the same participant to an alternative treatment. Response adaptive randomisation may, in some settings, improve the trial participants' outcomes and expedite trial conclusions, compared to fixed randomisation. But 'myopic' response adaptive randomisation strategies, blind to multistage dynamics, may also result in suboptimal treatment assignments. We propose a 'dynamic' response adaptive randomisation strategy based on Q-learning, an approximate dynamic programming algorithm. Q-learning uses stage-wise statistical models and backward induction to incorporate late-stage 'payoffs' (i.e. clinical outcomes) into early-stage 'actions' (i.e. treatments). Our real-world example consists of a COVID-19 prophylaxis and treatment SMART with qualitatively different binary endpoints at each stage. Standard Q-learning does not work with such data because it cannot be used for sequences of binary endpoints. Sequences of qualitatively distinct endpoints may also require different weightings to ensure that the design guides participants to regimens with the highest utility. We describe how a simple decision-theoretic extension to Q-learning can be used to handle sequential binary endpoints with distinct utilities. Using simulation we show that, under a set of binary utilities, the 'dynamic' approach increases expected participant utility compared to the fixed approach, sometimes markedly, for all model parameters, whereas the 'myopic' approach can actually decrease utility.

READ FULL TEXT

page 20

page 21

research
12/10/2020

Power prior models for treatment effect estimation in a small n, sequential, multiple assignment, randomized trial

A small n, sequential, multiple assignment, randomized trial (snSMART) i...
research
02/08/2022

Inferring Strategies from Observations in Long Iterated Prisoner's Dilemma Experiments

While many theoretical studies have revealed the strategies that could l...
research
09/01/2018

A Contextual-bandit-based Approach for Informed Decision-making in Clinical Trials

Clinical trials involving multiple treatments utilize randomization of t...
research
10/28/2022

SMART-EXAM: Incorporating Participants' Welfare into Sequential Multiple Assignment Randomized Trials

Dynamic treatment regimes (DTRs) are sequences of decision rules that re...
research
05/06/2020

DTR Bandit: Learning to Make Response-Adaptive Decisions With Low Regret

Dynamic treatment regimes (DTRs) for are personalized, sequential treatm...
research
03/17/2018

Power Analysis in a SMART Design: Sample Size Estimation for Determining the Best Dynamic Treatment Regime

Sequential, multiple assignment, randomized trial (SMART) designs have b...
research
02/25/2019

SMARTp: A SMART design for non-surgical treatments of chronic periodontitis with spatially-referenced and non-randomly missing skewed outcomes

This paper proposes dynamic treatment regimes for choosing individualize...

Please sign up or login with your details

Forgot password? Click here to reset