Privacy-Preserving Policy Synthesis in Markov Decision Processes

04/16/2020
by   Parham Gohari, et al.
0

In decision-making problems, the actions of an agent may reveal sensitive information that drives its decisions. For instance, a corporation's investment decisions may reveal its sensitive knowledge about market dynamics. To prevent this type of information leakage, we introduce a policy synthesis algorithm that protects the privacy of the transition probabilities in a Markov decision process. We use differential privacy as the mathematical definition of privacy. The algorithm first perturbs the transition probabilities using a mechanism that provides differential privacy. Then, based on the privatized transition probabilities, we synthesize a policy using dynamic programming. Our main contribution is to bound the "cost of privacy," i.e., the difference between the expected total rewards with privacy and the expected total rewards without privacy. We also show that computing the cost of privacy has time complexity that is polynomial in the parameters of the problem. Moreover, we establish that the cost of privacy increases with the strength of differential privacy protections, and we quantify this increase. Finally, numerical experiments on two example environments validate the established relationship between the cost of privacy and the strength of data privacy protections.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/30/2019

The Dirichlet Mechanism for Differential Privacy on the Unit Simplex

As members of a network share more information with each other and netwo...
research
01/20/2023

Differential Privacy in Cooperative Multiagent Planning

Privacy-aware multiagent systems must protect agents' sensitive data whi...
research
06/02/2022

Offline Reinforcement Learning with Differential Privacy

The offline reinforcement learning (RL) problem is often motivated by th...
research
09/23/2018

Towards Differential Privacy for Symbolic Systems

In this paper, we develop a privacy implementation for symbolic control ...
research
09/27/2020

Privacy-Preserving Dynamic Personalized Pricing with Demand Learning

The prevalence of e-commerce has made detailed customers' personal infor...
research
01/24/2021

A Linear Reduction Method for Local Differential Privacy and Log-lift

This paper considers the problem of publishing data X while protecting c...
research
08/06/2018

Correspondences between Privacy and Nondiscrimination: Why They Should Be Studied Together

Privacy and nondiscrimination are related but different. We make this ob...

Please sign up or login with your details

Forgot password? Click here to reset