Data Minimization for GDPR Compliance in Machine Learning Models

by   Abigail Goldsteen, et al.

The EU General Data Protection Regulation (GDPR) mandates the principle of data minimization, which requires that only data necessary to fulfill a certain purpose be collected. However, it can often be difficult to determine the minimal amount of data required, especially in complex machine learning models such as neural networks. We present a first-of-a-kind method to reduce the amount of personal data needed to perform predictions with a machine learning model, by removing or generalizing some of the input features. Our method makes use of the knowledge encoded within the model to produce a generalization that has little to no impact on its accuracy. This enables the creators and users of machine learning models to acheive data minimization, in a provable manner.



There are no comments yet.


page 1

page 2

page 3

page 4


ML Privacy Meter: Aiding Regulatory Compliance by Quantifying the Privacy Risks of Machine Learning

When building machine learning models using sensitive data, organization...

Algebraic Machine Learning

Machine learning algorithms use error function minimization to fit a lar...

Anonymizing Machine Learning Models

There is a known tension between the need to analyze personal data to dr...

QuantifyML: How Good is my Machine Learning Model?

The efficacy of machine learning models is typically determined by compu...

Operationalizing the Legal Principle of Data Minimization for Personalization

Article 5(1)(c) of the European Union's General Data Protection Regulati...

A Note on High-Probability versus In-Expectation Guarantees of Generalization Bounds in Machine Learning

Statistical machine learning theory often tries to give generalization g...

A Reflection on Learning from Data: Epistemology Issues and Limitations

Although learning from data is effective and has achieved significant mi...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.