PAC Identification of Many Good Arms in Stochastic Multi-Armed Bandits

01/24/2019
by   Arghya Roy Chaudhuri, et al.
0

We consider the problem of identifying any k out of the best m arms in an n-armed stochastic multi-armed bandit. Framed in the PAC setting, this particular problem generalises both the problem of `best subset selection' and that of selecting `one out of the best m' arms [arcsk 2017]. In applications such as crowd-sourcing and drug-designing, identifying a single good solution is often not sufficient. Moreover, finding the best subset might be hard due to the presence of many indistinguishably close solutions. Our generalisation of identifying exactly k arms out of the best m, where 1 ≤ k ≤ m, serves as a more effective alternative. We present a lower bound on the worst-case sample complexity for general k, and a fully sequential PAC algorithm, , which is more sample-efficient on easy instances. Also, extending our analysis to infinite-armed bandits, we present a PAC algorithm that is independent of n, which identifies an arm from the best ρ fraction of arms using at most an additive poly-log number of samples than compared to the lower bound, thereby improving over [arcsk 2017] and [Aziz+AKA:2018]. The problem of identifying k > 1 distinct arms from the best ρ fraction is not always well-defined; for a special class of this problem, we present lower and upper bounds. Finally, through a reduction, we establish a relation between upper bounds for the `one out of the best ρ' problem for infinite instances and the `one out of the best m' problem for finite instances. We conjecture that it is more efficient to solve `small' finite instances using the latter formulation, rather than going through the former.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/17/2013

On Finding the Largest Mean Among Many

Sampling from distributions to find the one with the largest mean arises...
research
10/28/2018

Exploring k out of Top ρ Fraction of Arms in Stochastic Bandits

This paper studies the problem of identifying any k distinct arms among ...
research
02/13/2022

On the complexity of All ε-Best Arms Identification

We consider the problem introduced by <cit.> of identifying all the ε-op...
research
07/10/2023

SHAP@k:Efficient and Probably Approximately Correct (PAC) Identification of Top-k Features

The SHAP framework provides a principled method to explain the predictio...
research
10/30/2022

Revisiting Simple Regret Minimization in Multi-Armed Bandits

Simple regret is a natural and parameter-free performance criterion for ...
research
11/17/2021

Max-Min Grouped Bandits

In this paper, we introduce a multi-armed bandit problem termed max-min ...
research
06/15/2019

The True Sample Complexity of Identifying Good Arms

We consider two multi-armed bandit problems with n arms: (i) given an ϵ ...

Please sign up or login with your details

Forgot password? Click here to reset