Achieving Privacy in the Adversarial Multi-Armed Bandit

01/16/2017
by   Aristide C. Y. Tossou, et al.
0

In this paper, we improve the previously best known regret bound to achieve ϵ-differential privacy in oblivious adversarial bandits from O(T^2/3/ϵ) to O(√(T) T /ϵ). This is achieved by combining a Laplace Mechanism with EXP3. We show that though EXP3 is already differentially private, it leaks a linear amount of information in T. However, we can improve this privacy by relying on its intrinsic exponential mechanism for selecting actions. This allows us to reach O(√( T))-DP, with a regret of O(T^2/3) that holds against an adaptive adversary, an improvement from the best known of O(T^3/4). This is done by using an algorithm that run EXP3 in a mini-batch loop. Finally, we run experiments that clearly demonstrate the validity of our theoretical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2017

The Price of Differential Privacy For Online Learning

We design differentially private algorithms for the problem of online li...
research
06/05/2021

Differentially Private Multi-Armed Bandits in the Shuffle Model

We give an (ε,δ)-differentially private algorithm for the multi-armed ba...
research
11/27/2015

Algorithms for Differentially Private Multi-Armed Bandits

We present differentially private algorithms for the stochastic Multi-Ar...
research
05/24/2021

Cascading Bandit under Differential Privacy

This paper studies differential privacy (DP) and local differential priv...
research
09/06/2022

When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

We study the problem of multi-armed bandits with ϵ-global Differential P...
research
05/29/2019

Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost?

We introduce a number of privacy definitions for the multi-armed bandit ...
research
08/30/2022

Dynamic Global Sensitivity for Differentially Private Contextual Bandits

Bandit algorithms have become a reference solution for interactive recom...

Please sign up or login with your details

Forgot password? Click here to reset