Privacy-Preserving Communication-Efficient Federated Multi-Armed Bandits

11/02/2021
by   Tan Li, et al.
0

Communication bottleneck and data privacy are two critical concerns in federated multi-armed bandit (MAB) problems, such as situations in decision-making and recommendations of connected vehicles via wireless. In this paper, we design the privacy-preserving communication-efficient algorithm in such problems and study the interactions among privacy, communication and learning performance in terms of the regret. To be specific, we design privacy-preserving learning algorithms and communication protocols and derive the learning regret when networked private agents are performing online bandit learning in a master-worker, a decentralized and a hybrid structure. Our bandit learning algorithms are based on epoch-wise sub-optimal arm eliminations at each agent and agents exchange learning knowledge with the server/each other at the end of each epoch. Furthermore, we adopt the differential privacy (DP) approach to protect the data privacy at each agent when exchanging information; and we curtail communication costs by making less frequent communications with fewer agents participation. By analyzing the regret of our proposed algorithmic framework in the master-worker, decentralized and hybrid structures, we theoretically show tradeoffs between regret and communication costs/privacy. Finally, we empirically show these trade-offs which are consistent with our theoretical analysis.

READ FULL TEXT
research
08/31/2022

Federated Online Clustering of Bandits

Contextual multi-armed bandit (MAB) is an important sequential decision-...
research
01/30/2023

Private Node Selection in Personalized Decentralized Learning

In this paper, we propose a novel approach for privacy-preserving node s...
research
10/24/2020

Federated Bandit: A Gossiping Approach

In this paper, we study Federated Bandit, a decentralized Multi-Armed Ba...
research
01/28/2023

Decentralized Entropic Optimal Transport for Privacy-preserving Distributed Distribution Comparison

Privacy-preserving distributed distribution comparison measures the dist...
research
05/14/2020

Federated Recommendation System via Differential Privacy

In this paper, we are interested in what we term the federated private b...
research
10/23/2020

Graph-Homomorphic Perturbations for Private Decentralized Learning

Decentralized algorithms for stochastic optimization and learning rely o...
research
06/27/2022

Differentially Private Federated Combinatorial Bandits with Constraints

There is a rapid increase in the cooperative learning paradigm in online...

Please sign up or login with your details

Forgot password? Click here to reset