Multiple Identifications in Multi-Armed Bandits

05/14/2012
by   Sébastien Bubeck, et al.
0

We study the problem of identifying the top m arms in a multi-armed bandit game. Our proposed solution relies on a new algorithm based on successive rejects of the seemingly bad arms, and successive accepts of the good ones. This algorithmic contribution allows to tackle other multiple identifications settings that were previously out of reach. In particular we show that this idea of successive accepts and rejects applies to the multi-bandit best arm identification problem.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/20/2023

Multi armed bandits and quantum channel oracles

Multi armed bandits are one of the theoretical pillars of reinforcement ...
research
11/15/2018

Pure-Exploration for Infinite-Armed Bandits with General Arm Reservoirs

This paper considers a multi-armed bandit game where the number of arms ...
research
07/14/2020

Generic Outlier Detection in Multi-Armed Bandit

In this paper, we study the problem of outlier arm detection in multi-ar...
research
07/25/2020

Multi-Armed Bandits for Minesweeper: Profiting from Exploration-Exploitation Synergy

A popular computer puzzle, the game of Minesweeper requires its human pl...
research
09/18/2019

Environmental Hotspot Identification in Limited Time with a UAV Equipped with a Downward-Facing Camera

We are motivated by environmental monitoring tasks where finding the glo...
research
01/28/2022

Networked Restless Multi-Armed Bandits for Mobile Interventions

Motivated by a broad class of mobile intervention problems, we propose a...
research
02/18/2022

Adaptivity and Confounding in Multi-Armed Bandit Experiments

We explore a new model of bandit experiments where a potentially nonstat...

Please sign up or login with your details

Forgot password? Click here to reset