Adaptive Control of Differentially Private Linear Quadratic Systems

08/26/2021
by   Sayak Ray Chowdhury, et al.
0

In this paper, we study the problem of regret minimization in reinforcement learning (RL) under differential privacy constraints. This work is motivated by the wide range of RL applications for providing personalized service, where privacy concerns are becoming paramount. In contrast to previous works, we take the first step towards non-tabular RL settings, while providing a rigorous privacy guarantee. In particular, we consider the adaptive control of differentially private linear quadratic (LQ) systems. We develop the first private RL algorithm, PRL, which is able to attain a sub-linear regret while guaranteeing privacy protection. More importantly, the additional cost due to privacy is only on the order of ln(1/δ)^1/4/ϵ^1/2 given privacy parameters ϵ, δ > 0. Through this process, we also provide a general procedure for adaptive control of LQ systems under changing regularizers, which not only generalizes previous non-private controls, but also serves as the basis for general private controls.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2022

Differentially Private Reinforcement Learning with Linear Function Approximation

Motivated by the wide adoption of reinforcement learning (RL) in real-wo...
research
12/09/2022

Near-Optimal Differentially Private Reinforcement Learning

Motivated by personalized healthcare and other applications involving se...
research
02/02/2022

Improved Regret for Differentially Private Exploration in Linear MDP

We study privacy-preserving exploration in sequential decision-making fo...
research
10/15/2020

Local Differentially Private Regret Minimization in Reinforcement Learning

Reinforcement learning algorithms are widely used in domains where it is...
research
12/20/2021

Differentially Private Regret Minimization in Episodic Markov Decision Processes

We study regret minimization in finite horizon tabular Markov decision p...
research
04/30/2018

Learning Optimal Reserve Price against Non-myopic Bidders

We consider the problem of learning optimal reserve price in repeated au...
research
06/24/2018

On The Differential Privacy of Thompson Sampling With Gaussian Prior

We show that Thompson Sampling with Gaussian Prior as detailed by Algori...

Please sign up or login with your details

Forgot password? Click here to reset