When Privacy Meets Partial Information: A Refined Analysis of Differentially Private Bandits

09/06/2022
by   Achraf Azize, et al.
5

We study the problem of multi-armed bandits with ϵ-global Differential Privacy (DP). First, we prove the minimax and problem-dependent regret lower bounds for stochastic and linear bandits that quantify the hardness of bandits with ϵ-global DP. These bounds suggest the existence of two hardness regimes depending on the privacy budget ϵ. In the high-privacy regime (small ϵ), the hardness depends on a coupled effect of privacy and partial information about the reward distributions. In the low-privacy regime (large ϵ), bandits with ϵ-global DP are not harder than the bandits without privacy. For stochastic bandits, we further propose a generic framework to design a near-optimal ϵ global DP extension of an index-based optimistic bandit algorithm. The framework consists of three ingredients: the Laplace mechanism, arm-dependent adaptive episodes, and usage of only the rewards collected in the last episode for computing private statistics. Specifically, we instantiate ϵ-global DP extensions of UCB and KL-UCB algorithms, namely AdaP-UCB and AdaP-KLUCB. AdaP-KLUCB is the first algorithm that both satisfies ϵ-global DP and yields a regret upper bound that matches the problem-dependent lower bound up to multiplicative constants.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/01/2023

Interactive and Concentrated Differential Privacy for Bandits

Bandits play a crucial role in interactive learning schemes and modern r...
research
09/05/2023

On the Complexity of Differentially Private Best-Arm Identification with Fixed Confidence

Best Arm Identification (BAI) problems are progressively used for data-s...
research
06/01/2020

(Locally) Differentially Private Combinatorial Semi-Bandits

In this paper, we study Combinatorial Semi-Bandits (CSB) that is an exte...
research
07/12/2022

Differentially Private Linear Bandits with Partial Distributed Feedback

In this paper, we study the problem of global reward maximization with o...
research
06/04/2021

Optimal Rates of (Locally) Differentially Private Heavy-tailed Multi-Armed Bandits

In this paper we study the problem of stochastic multi-armed bandits (MA...
research
01/16/2017

Achieving Privacy in the Adversarial Multi-Armed Bandit

In this paper, we improve the previously best known regret bound to achi...
research
06/01/2020

Locally Differentially Private (Contextual) Bandits Learning

We study locally differentially private (LDP) bandits learning in this p...

Please sign up or login with your details

Forgot password? Click here to reset