Distributed Differential Privacy in Multi-Armed Bandits

06/12/2022
by   Sayak Ray Chowdhury, et al.
0

We consider the standard K-armed bandit problem under a distributed trust model of differential privacy (DP), which enables to guarantee privacy without a trustworthy server. Under this trust model, previous work largely focus on achieving privacy using a shuffle protocol, where a batch of users data are randomly permuted before sending to a central server. This protocol achieves (ϵ,δ) or approximate-DP guarantee by sacrificing an additional additive O(Klog T√(log(1/δ))/ϵ) cost in T-step cumulative regret. In contrast, the optimal privacy cost for achieving a stronger (ϵ,0) or pure-DP guarantee under the widely used central trust model is only Θ(Klog T/ϵ), where, however, a trusted server is required. In this work, we aim to obtain a pure-DP guarantee under distributed trust model while sacrificing no more regret than that under central trust model. We achieve this by designing a generic bandit algorithm based on successive arm elimination, where privacy is guaranteed by corrupting rewards with an equivalent discrete Laplace noise ensured by a secure computation protocol. We also show that our algorithm, when instantiated with Skellam noise and the secure protocol, ensures Rényi differential privacy – a stronger notion than approximate DP – under distributed trust model with a privacy cost of O(K√(log T)/ϵ).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2022

Shuffle Private Linear Contextual Bandits

Differential privacy (DP) has been recently introduced to linear context...
research
05/24/2021

Cascading Bandit under Differential Privacy

This paper studies differential privacy (DP) and local differential priv...
research
05/29/2019

Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost?

We introduce a number of privacy definitions for the multi-armed bandit ...
research
06/08/2021

Private Counting from Anonymous Messages: Near-Optimal Accuracy with Vanishing Communication Overhead

Differential privacy (DP) is a formal notion for quantifying the privacy...
research
02/05/2020

Pure Differentially Private Summation from Anonymous Messages

The shuffled (aka anonymous) model has recently generated significant in...
research
08/31/2022

Federated Online Clustering of Bandits

Contextual multi-armed bandit (MAB) is an important sequential decision-...
research
01/31/2022

Aggregation and Transformation of Vector-Valued Messages in the Shuffle Model of Differential Privacy

Advances in communications, storage and computational technology allow s...

Please sign up or login with your details

Forgot password? Click here to reset