Mixture Martingales Revisited with Applications to Sequential Tests and Confidence Intervals

11/28/2018
by   Emilie Kaufmann, et al.
0

This paper presents new deviation inequalities that are valid uniformly in time under adaptive sampling in a multi-armed bandit model. The deviations are measured using the Kullback-Leibler divergence in a given one-dimensional exponential family, and may take into account several arms at a time. They are obtained by constructing for each arm a mixture martingale based on a hierarchical prior, and by multiplying those martingales. Our deviation inequalities allow us to analyze stopping rules based on generalized likelihood ratios for a large class of sequential identification problems, and to construct tight confidence intervals for some functions of the means of the arms.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2019

Sequential estimation of quantiles with applications to A/B-testing and best-arm identification

Consider the problem of sequentially estimating quantiles of any distrib...
research
06/21/2023

Qini Curves for Multi-Armed Treatment Rules

Qini curves have emerged as an attractive and popular approach for evalu...
research
11/07/2019

Confidence Intervals for Policy Evaluation in Adaptive Experiments

Adaptive experiments can result in considerable cost savings in multi-ar...
research
10/16/2020

Nonparametric iterated-logarithm extensions of the sequential generalized likelihood ratio test

We develop a nonparametric extension of the sequential generalized likel...
research
06/09/2020

Near-Optimal Confidence Sequences for Bounded Random Variables

Many inference problems, such as sequential decision problems like A/B t...
research
12/11/2017

Optimal Odd Arm Identification with Fixed Confidence

The problem of detecting an odd arm from a set of K arms of a multi-arme...
research
07/01/2023

Adaptive Algorithms for Relaxed Pareto Set Identification

In this paper we revisit the fixed-confidence identification of the Pare...

Please sign up or login with your details

Forgot password? Click here to reset