A Parallel algorithm for X-Armed bandits

10/26/2015
by   Cheng Chen, et al.
0

The target of X-armed bandit problem is to find the global maximum of an unknown stochastic function f, given a finite budget of n evaluations. Recently, X-armed bandits have been widely used in many situations. Many of these applications need to deal with large-scale data sets. To deal with these large-scale data sets, we study a distributed setting of X-armed bandits, where m players collaborate to find the maximum of the unknown function. We develop a novel anytime distributed X-armed bandit algorithm. Compared with prior work on X-armed bandits, our algorithm uses a quite different searching strategy so as to fit distributed learning scenarios. Our theoretical analysis shows that our distributed algorithm is m times faster than the classical single-player algorithm. Moreover, the number of communication rounds of our algorithm is only logarithmic in mn. The numerical results show that our method can make effective use of every players to minimize the loss. Thus, our distributed approach is attractive and useful.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/27/2017

Max K-armed bandit: On the ExtremeHunter algorithm and beyond

This paper is devoted to the study of the max K-armed bandit problem, wh...
research
02/14/2020

Coordination without communication: optimal regret in two players multi-armed bandits

We consider two agents playing simultaneously the same stochastic three-...
research
01/31/2021

Quantifying the Global Support Network for Non-State Armed Groups (NAGs)

Human history has been shaped by armed conflicts. Rather than large-scal...
research
11/08/2021

An Instance-Dependent Analysis for the Cooperative Multi-Player Multi-Armed Bandit

We study the problem of information sharing and cooperation in Multi-Pla...
research
08/08/2023

Cooperative Multi-agent Bandits: Distributed Algorithms with Optimal Individual Regret and Constant Communication Costs

Recently, there has been extensive study of cooperative multi-agent mult...
research
03/27/2017

A Scale Free Algorithm for Stochastic Bandits with Bounded Kurtosis

Existing strategies for finite-armed stochastic bandits mostly depend on...
research
04/21/2021

Searching with Opponent-Awareness

We propose Searching with Opponent-Awareness (SOA), an approach to lever...

Please sign up or login with your details

Forgot password? Click here to reset