Functional Sequential Treatment Allocation with Covariates

01/29/2020
by   Anders Bredahl Kock, et al.
0

We consider a multi-armed bandit problem with covariates. Given a realization of the covariate vector, instead of targeting the treatment with highest conditional expectation, the decision maker targets the treatment which maximizes a general functional of the conditional potential outcome distribution, e.g., a conditional quantile, trimmed mean, or a socio-economic functional such as an inequality, welfare or poverty measure. We develop expected regret lower bounds for this problem, and construct a near minimax optimal assignment policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/27/2011

The multi-armed bandit problem with covariates

We consider a multi-armed bandit problem in a setting where each arm pro...
research
05/19/2020

Treatment recommendation with distributional targets

We study the problem of a decision maker who must provide the best possi...
research
09/06/2021

Efficient Learning of Optimal Individualized Treatment Rules for Heteroscedastic or Misspecified Treatment-Free Effect Models

Recent development in data-driven decision science has seen great advanc...
research
05/12/2018

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

In this paper we consider the dynamic assortment selection problem under...
research
05/12/2015

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

The purpose of this paper is to provide further understanding into the s...
research
05/11/2022

Externally Valid Treatment Choice

We consider the problem of learning treatment (or policy) rules that are...
research
05/03/2018

Nonparametric Learning and Optimization with Covariates

Modern decision analytics frequently involves the optimization of an obj...

Please sign up or login with your details

Forgot password? Click here to reset