Privacy-Preserving Synthetic Educational Data Generation

07/07/2022
by   Jill-Jênn Vie, et al.
0

Institutions collect massive learning traces but they may not disclose it for privacy issues. Synthetic data generation opens new opportunities for research in education. In this paper we present a generative model for educational data that can preserve the privacy of participants, and an evaluation framework for comparing synthetic data generators. We show how naive pseudonymization can lead to re-identification threats and suggest techniques to guarantee privacy. We evaluate our method on existing massive educational open datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2020

Synthetic Data – A Privacy Mirage

Synthetic datasets drawn from generative models have been advertised as ...
research
03/11/2022

FedSyn: Synthetic Data Generation using Federated Learning

As Deep Learning algorithms continue to evolve and become more sophistic...
research
09/27/2022

Privacy-Preserving Synthetic Data Generation for Recommendation Systems

Recommendation systems make predictions chiefly based on users' historic...
research
03/02/2023

GlucoSynth: Generating Differentially-Private Synthetic Glucose Traces

In this paper we focus on the problem of generating high-quality, privat...
research
04/03/2023

Coincidental Generation

Generative A.I. models have emerged as versatile tools across diverse in...
research
10/28/2021

Generating synthetic transactional profiles

Financial institutions use clients' payment transactions in numerous ban...
research
05/08/2019

Reconstruction of Privacy-Sensitive Data from Protected Templates

In this paper, we address the problem of data reconstruction from privac...

Please sign up or login with your details

Forgot password? Click here to reset