DeepAI AI Chat
Log In Sign Up

Functional Sequential Treatment Allocation with Covariates

by   Anders Bredahl Kock, et al.

We consider a multi-armed bandit problem with covariates. Given a realization of the covariate vector, instead of targeting the treatment with highest conditional expectation, the decision maker targets the treatment which maximizes a general functional of the conditional potential outcome distribution, e.g., a conditional quantile, trimmed mean, or a socio-economic functional such as an inequality, welfare or poverty measure. We develop expected regret lower bounds for this problem, and construct a near minimax optimal assignment policy.


page 1

page 2

page 3

page 4


The multi-armed bandit problem with covariates

We consider a multi-armed bandit problem in a setting where each arm pro...

Treatment recommendation with distributional targets

We study the problem of a decision maker who must provide the best possi...

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

In this paper we consider the dynamic assortment selection problem under...

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

The purpose of this paper is to provide further understanding into the s...

Externally Valid Treatment Choice

We consider the problem of learning treatment (or policy) rules that are...

Nonparametric Learning and Optimization with Covariates

Modern decision analytics frequently involves the optimization of an obj...