Confidence Intervals for Policy Evaluation in Adaptive Experiments

11/07/2019
by   Vitor Hadad, et al.
14

Adaptive experiments can result in considerable cost savings in multi-armed trials by enabling analysts to quickly focus on the most promising alternatives. Most existing work on adaptive experiments (which include multi-armed bandits) has focused maximizing the speed at which the analyst can identify the optimal arm and/or minimizing the number of draws from sub-optimal arms. In many scientific settings, however, it is not only of interest to identify the optimal arm, but also to perform a statistical analysis of the data collected from the experiment. Naive approaches to statistical inference with adaptive inference fail because many commonly used statistics (such as sample means or inverse propensity weighting) do not have an asymptotically Gaussian limiting distribution centered on the estimate, and so confidence intervals constructed from these statistics do not have correct coverage. But, as shown in this paper, carefully designed data-adaptive weighting schemes can be used to overcome this issue and restore a relevant central limit theorem, enabling hypothesis testing. We validate the accuracy of the resulting confidence intervals in numerical experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2023

Budgeted Multi-Armed Bandits with Asymmetric Confidence Intervals

We study the stochastic Budgeted Multi-Armed Bandit (MAB) problem, where...
research
02/27/2023

Design-Based Inference for Multi-arm Bandits

Multi-arm bandits are gaining popularity as they enable real-world seque...
research
02/08/2020

Inference for Batched Bandits

As bandit algorithms are increasingly utilized in scientific studies, th...
research
12/18/2017

Accurate Inference for Adaptive Linear Models

Estimators computed from adaptively collected data do not behave like th...
research
11/28/2018

Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals

This paper presents new deviation inequalities that are valid uniformly ...
research
06/21/2023

Qini Curves for Multi-Armed Treatment Rules

Qini curves have emerged as an attractive and popular approach for evalu...
research
02/17/2020

Are You Sure You're Sure? – Effects of Visual Representation on the Cliff Effect in Statistical Inference

Common reporting styles of statistical results, such as confidence inter...

Please sign up or login with your details

Forgot password? Click here to reset