Learning from aggregated data with a maximum entropy model

10/05/2022
by   Alexandre Gilotte, et al.
0

Aggregating a dataset, then injecting some noise, is a simple and common way to release differentially private data.However, aggregated data – even without noise – is not an appropriate input for machine learning classifiers.In this work, we show how a new model, similar to a logistic regression, may be learned from aggregated data only by approximating the unobserved feature distribution with a maximum entropy hypothesis. The resulting model is a Markov Random Field (MRF), and we detail how to apply, modify and scale a MRF training algorithm to our setting. Finally we present empirical evidence on several public datasets that the model learned this way can achieve performances comparable to those of a logistic model trained with the full unaggregated data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/24/2023

Sparse Private LASSO Logistic Regression

LASSO regularized logistic regression is particularly useful for its bui...
research
03/02/2023

Choosing Public Datasets for Private Machine Learning via Gradient Subspace Distance

Differentially private stochastic gradient descent privatizes model trai...
research
06/24/2020

Distributionally-Robust Machine Learning Using Locally Differentially-Private Data

We consider machine learning, particularly regression, using locally-dif...
research
07/30/2014

Differentially-Private Logistic Regression for Detecting Multiple-SNP Association in GWAS Databases

Following the publication of an attack on genome-wide association studie...
research
01/14/2020

Private Machine Learning via Randomised Response

We introduce a general learning framework for private machine learning b...
research
08/08/2023

Accurate, Explainable, and Private Models: Providing Recourse While Minimizing Training Data Leakage

Machine learning models are increasingly utilized across impactful domai...
research
08/28/2020

Introduction to logistic regression

For random field theory based multiple comparison corrections In brain i...

Please sign up or login with your details

Forgot password? Click here to reset