Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost?

05/29/2019
by   Debabrota Basu, et al.
0

We introduce a number of privacy definitions for the multi-armed bandit problem, based on differential privacy. We relate them through a unifying graphical model representation and connect them to existing definitions. We then derive and contrast lower bounds on the regret of bandit algorithms satisfying these definitions. We show that for all of them, the learner's regret is increased by a multiplicative factor dependent on the privacy level ϵ, but that the dependency is weaker when we do not require local differential privacy for the rewards.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2020

Multi-Armed Bandits with Local Differential Privacy

This paper investigates the problem of regret minimization for multi-arm...
research
07/03/2023

Thompson Sampling under Bernoulli Rewards with Local Differential Privacy

This paper investigates the problem of regret minimization for multi-arm...
research
09/01/2023

Interactive and Concentrated Differential Privacy for Bandits

Bandits play a crucial role in interactive learning schemes and modern r...
research
06/12/2022

Distributed Differential Privacy in Multi-Armed Bandits

We consider the standard K-armed bandit problem under a distributed trus...
research
08/16/2017

Corrupt Bandits for Preserving Local Privacy

We study a variant of the stochastic multi-armed bandit (MAB) problem in...
research
08/31/2022

Federated Online Clustering of Bandits

Contextual multi-armed bandit (MAB) is an important sequential decision-...
research
01/16/2017

Achieving Privacy in the Adversarial Multi-Armed Bandit

In this paper, we improve the previously best known regret bound to achi...

Please sign up or login with your details

Forgot password? Click here to reset