Private Q-Learning with Functional Noise in Continuous Spaces

01/30/2019
by   Baoxiang Wang, et al.
0

We consider privacy-preserving algorithms for deep reinforcement learning. State-of-the-art methods that guarantee differential privacy are not extendable to very large state spaces because the noise level necessary to ensure privacy would scale to infinity. We address the problem of providing differential privacy in Q-learning where a function approximation through a neural network is used for parametrization. We develop a rigorous and efficient algorithm by inspecting the reproducing kernel Hilbert space in which the neural network is embedded. Our approach uses functional noise to guarantee privacy, while the noise level scales linearly with the complexity of the neural network architecture. There are no known theoretical guarantees on the performance of deep reinforcement learning, but we gain some insight by providing a utility analysis under the discrete space setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2012

Differential Privacy for Functions and Functional Data

Differential privacy is a framework for privately releasing summaries of...
research
03/04/2020

Privacy-preserving Learning via Deep Net Pruning

This paper attempts to answer the question whether neural network prunin...
research
09/18/2020

Private Reinforcement Learning with PAC and Regret Guarantees

Motivated by high-stakes decision-making domains like personalized medic...
research
11/11/2019

Achieving Differential Privacy in Vertically Partitioned Multiparty Learning

Preserving differential privacy has been well studied under centralized ...
research
06/22/2020

P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model

How can we release a massive volume of sensitive data while mitigating p...
research
07/06/2021

A Short Note on the Relationship of Information Gain and Eluder Dimension

Eluder dimension and information gain are two widely used methods of com...
research
10/23/2020

Learning to Noise: Application-Agnostic Data Sharing with Local Differential Privacy

In recent years, the collection and sharing of individuals' private data...

Please sign up or login with your details

Forgot password? Click here to reset