Collaboratively Learning the Best Option on Graphs, Using Bounded Local Memory

11/08/2018
by   Lili Su, et al.
0

We consider multi-armed bandit problems in social groups wherein each individual has bounded memory and shares the common goal of learning the best arm/option. We say an individual learns the best option if eventually (as t→∞) it pulls only the arm with the highest expected reward. While this goal is provably impossible for an isolated individual due to bounded memory, we show that, in social groups, this goal can be achieved easily with the aid of social persuasion (i.e., communication) as long as the communication networks/graphs satisfy some mild conditions. To deal with the interplay between the randomness in the rewards and in the social interaction, we employ the mean-field approximation method. Considering the possibility that the individuals in the networks may not be exchangeable when the communication networks are not cliques, we go beyond the classic mean-field techniques and apply a refined version of mean-field approximation: (1) Using coupling we show that, if the communication graph is connected and is either regular or has doubly-stochastic degree-weighted adjacency matrix, with probability → 1 as the social group size N →∞, every individual in the social group learns the best option. (2) If the minimum degree of the graph diverges as N →∞, over an arbitrary but given finite time horizon, the sample paths describing the opinion evolutions of the individuals are asymptotically independent. In addition, the proportions of the population with different opinions converge to the unique solution of a system of ODEs. In the solution of the obtained ODEs, the proportion of the population holding the correct opinion converges to 1 exponentially fast in time. Notably, our results hold even if the communication graphs are highly sparse.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/22/2018

Collaboratively Learning the Best Option, Using Bounded Memory

We consider multi-armed bandit problems in social groups wherein each in...
research
01/24/2021

Mean-field Approximation for Stochastic Population Processes in Networks under Imperfect Information

This paper studies a general class of stochastic population processes in...
research
12/29/2022

Inference of interaction kernels in mean-field models of opinion dynamics

In models of opinion dynamics, many parameters – either in the form of c...
research
05/03/2021

Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward

Mean field game facilitates analyzing multi-armed bandit (MAB) for a lar...
research
08/19/2022

Almost Cost-Free Communication in Federated Best Arm Identification

We study the problem of best arm identification in a federated learning ...
research
11/17/2022

Cross-inhibition leads to group consensus despite the presence of strongly opinionated minorities and asocial behaviour

Strongly opinionated minorities can have a dramatic impact on the opinio...

Please sign up or login with your details

Forgot password? Click here to reset