dpUGC: Learn Differentially Private Representation for User Generated Contents

03/25/2019
by   Xuan-Son Vu, et al.
0

This paper firstly proposes a simple yet efficient generalized approach to apply differential privacy to text representation (i.e., word embedding). Based on it, we propose a user-level approach to learn personalized differentially private word embedding model on user generated contents (UGC). To our best knowledge, this is the first work of learning user-level differentially private word embedding model from text for sharing. The proposed approaches protect the privacy of the individual from re-identification, especially provide better trade-off of privacy and data utility on UGC data for sharing. The experimental results show that the trained embedding models are applicable for the classic text analysis tasks (e.g., regression). Moreover, the proposed approaches of learning differentially private embedding models are both framework- and data- independent, which facilitates the deployment and sharing. The source code is available at https://github.com/sonvx/dpText.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2023

Differentially Private Decentralized Deep Learning with Consensus Algorithms

Cooperative decentralized deep learning relies on direct information exc...
research
05/02/2020

Differentially Private Generation of Small Images

We explore the training of generative adversarial networks with differen...
research
02/22/2021

Differentially Private Supervised Manifold Learning with Applications like Private Image Retrieval

Differential Privacy offers strong guarantees such as immutable privacy ...
research
02/23/2022

Differentially Private Speaker Anonymization

Sharing real-world speech utterances is key to the training and deployme...
research
08/07/2023

Randomized algorithms for precise measurement of differentially-private, personalized recommendations

Personalized recommendations form an important part of today's internet ...
research
10/18/2020

Enabling Fast Differentially Private SGD via Just-in-Time Compilation and Vectorization

A common pain point in differentially private machine learning is the si...
research
10/28/2019

Improved Differentially Private Decentralized Source Separation for fMRI Data

Blind source separation algorithms such as independent component analysi...

Please sign up or login with your details

Forgot password? Click here to reset