Thompson Sampling under Bernoulli Rewards with Local Differential Privacy

07/03/2023
by   Bo Jiang, et al.
0

This paper investigates the problem of regret minimization for multi-armed bandit (MAB) problems with local differential privacy (LDP) guarantee. Given a fixed privacy budget ϵ, we consider three privatizing mechanisms under Bernoulli scenario: linear, quadratic and exponential mechanisms. Under each mechanism, we derive stochastic regret bound for Thompson Sampling algorithm. Finally, we simulate to illustrate the convergence of different mechanisms under different privacy budgets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2020

Multi-Armed Bandits with Local Differential Privacy

This paper investigates the problem of regret minimization for multi-arm...
research
05/29/2019

Differential Privacy for Multi-armed Bandits: What Is It and What Is Its Cost?

We introduce a number of privacy definitions for the multi-armed bandit ...
research
08/16/2017

Corrupt Bandits for Preserving Local Privacy

We study a variant of the stochastic multi-armed bandit (MAB) problem in...
research
08/04/2020

Verifying Pufferfish Privacy in Hidden Markov Models

Pufferfish is a Bayesian privacy framework for designing and analyzing p...
research
06/15/2018

Customized Local Differential Privacy for Multi-Agent Distributed Optimization

Real-time data-driven optimization and control problems over networks ma...
research
10/02/2017

Rényi Differential Privacy Mechanisms for Posterior Sampling

Using a recently proposed privacy definition of Rényi Differential Priva...
research
05/19/2023

Differentially Private Online Item Pricing

This work addresses the problem of revenue maximization in a repeated, u...

Please sign up or login with your details

Forgot password? Click here to reset