We study a sequential decision problem where the learner faces a sequenc...
We develop a meta-learning framework for simple regret minimization in
b...
We consider the problem of finding, through adaptive sampling, which of ...
We study the problem of best-arm identification (BAI) in contextual band...