Diffusion Asymptotics for Sequential Experiments

01/25/2021
by   Stefan Wager, et al.
0

We propose a new diffusion-asymptotic analysis for sequentially randomized experiments. Rather than taking sample size n to infinity while keeping the problem parameters fixed, we let the mean signal level scale to the order 1/√(n) so as to preserve the difficulty of the learning task as n gets large. In this regime, we show that the behavior of a class of methods for sequential experimentation converges to a diffusion limit. This connection enables us to make sharp performance predictions and obtain new insights on the behavior of Thompson sampling. Our diffusion asymptotics also help resolve a discrepancy between the Θ(log(n)) regret predicted by the fixed-parameter, large-sample asymptotics on the one hand, and the Θ(√(n)) regret from worst-case, finite-sample analysis on the other, suggesting that it is an appropriate asymptotic regime for understanding practical large-scale sequential experiments.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2021

Diffusion Approximations for a Class of Sequential Testing Problems

We consider a decision maker who must choose an action in order to maxim...
research
05/11/2015

Foundational principles for large scale inference: Illustrations through correlation mining

When can reliable inference be drawn in the "Big Data" context? This pap...
research
06/25/2019

On the Relationship Between Measures of Relative Efficiency for Random Signal Detection

Relative efficiency (RE), the Pitman asymptotic relative efficiency (ARE...
research
06/03/2021

A Closer Look at the Worst-case Behavior of Multi-armed Bandit Algorithms

One of the key drivers of complexity in the classical (stochastic) multi...
research
06/16/2019

Sample Size Calculations for SMARTs

Sequential Multiple Assignment Randomized Trials (SMARTs) are considered...
research
02/13/2020

Predictive Power of Nearest Neighbors Algorithm under Random Perturbation

We consider a data corruption scenario in the classical k Nearest Neighb...
research
06/20/2022

Thompson Sampling Efficiently Learns to Control Diffusion Processes

Diffusion processes that evolve according to linear stochastic different...

Please sign up or login with your details

Forgot password? Click here to reset