DeepAI AI Chat
Log In Sign Up

Functional Sequential Treatment Allocation with Covariates

01/29/2020
by   Anders Bredahl Kock, et al.
0

We consider a multi-armed bandit problem with covariates. Given a realization of the covariate vector, instead of targeting the treatment with highest conditional expectation, the decision maker targets the treatment which maximizes a general functional of the conditional potential outcome distribution, e.g., a conditional quantile, trimmed mean, or a socio-economic functional such as an inequality, welfare or poverty measure. We develop expected regret lower bounds for this problem, and construct a near minimax optimal assignment policy.

READ FULL TEXT

page 1

page 2

page 3

page 4

10/27/2011

The multi-armed bandit problem with covariates

We consider a multi-armed bandit problem in a setting where each arm pro...
05/19/2020

Treatment recommendation with distributional targets

We study the problem of a decision maker who must provide the best possi...
05/12/2018

Near-Optimal Policies for Dynamic Multinomial Logit Assortment Selection Models

In this paper we consider the dynamic assortment selection problem under...
05/12/2015

Asymptotic Behavior of Minimal-Exploration Allocation Policies: Almost Sure, Arbitrarily Slow Growing Regret

The purpose of this paper is to provide further understanding into the s...
05/11/2022

Externally Valid Treatment Choice

We consider the problem of learning treatment (or policy) rules that are...
05/03/2018

Nonparametric Learning and Optimization with Covariates

Modern decision analytics frequently involves the optimization of an obj...