Optimal dynamic treatment allocation

05/28/2017
by   Anders Bredahl Kock, et al.
0

In a treatment allocation problem the individuals to be treated often arrive gradually. Initially, when the first treatments are made, little is known about the effect of the treatments but as more treatments are assigned the policy maker learns about their effects by observing outcomes. Thus, there is a tradeoff between exploring the available treatments to learn about their merits and exploiting the best treatment, i.e. administering it as often as possible, in order to maximise the cumulative welfare of all the assignments made. Furthermore, a policy maker may not only be interested in the expected effect of the treatment but also its riskiness. Thus, we allow the welfare function to depend on the first and second moments of the distribution of treatment outcomes. We propose a dynamic treatment policy which attains the minimax optimal regret relative to the unknown best treatment in this dynamic setting. We allow for the data to arrive in batches as, say, unemployment programs only start once a month or blood samples are only send to the laboratory for investigation in batches. Furthermore, we show that the minimax optimality does not come at the price of overly aggressive experimentation as we provide upper bounds on the expected number of times any suboptimal treatment is assigned. We also consider the case where the outcome of a treatment is only observed with delay as it may take time for the treatment to work. Thus, a doctor faces a tradeoff between getting imprecise information quickly by making the measurement soon after the treatment is given or getting precise information later at the expense of less information for the individuals who are treated in the meantime. Finally, using Danish register data, we show how our treatment policy can be used to assign unemployed to active labor market policy programs in order to maximise the probability of ending the unemployment spell.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/09/2021

Estimation of Optimal Dynamic Treatment Assignment Rules under Policy Constraint

This paper studies statistical decisions for dynamic treatment assignmen...
research
05/19/2020

Treatment recommendation with distributional targets

We study the problem of a decision maker who must provide the best possi...
research
02/03/2020

Improving Pest Monitoring Networks in order to reduce pesticide use in agriculture

Disease and pest control largely rely on pesticides use and progress sti...
research
11/12/2020

Learning Personalized Policy with Strategic Agents

There is increasing interest in allocating treatments based on observed ...
research
06/01/2022

(Machine) Learning What Policies Value

When a policy prioritizes one person over another, is it because they be...
research
08/09/2019

Minimax Crossover Designs

In crossover experiments, two broad types of treatment effects are typic...
research
05/25/2020

Fair Policy Targeting

One of the major concerns of targeting interventions on individuals in s...

Please sign up or login with your details

Forgot password? Click here to reset