Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

11/13/2017

∙

We analyze the sample complexity of the thresholding bandit problem, with and without the assumption that the mean values of the arms are increasing. In each case, we provide a lower bound valid for any risk δ and any δ-correct algorithm; in addition, we propose an algorithm whose sample complexity is of the same order of magnitude for small risks. This work is motivated by phase 1 clinical trials, a practically important setting where the arm means are increasing by nature, and where no satisfactory solution is available so far.

READ FULL TEXT

Thresholding Bandit for Dose-ranging: The Impact of Monotonicity

Sign in with Google

Consider DeepAI Pro