Generating synthetic transactional profiles

10/28/2021
by   Hadrien Lautraite, et al.
0

Financial institutions use clients' payment transactions in numerous banking applications. Transactions are very personal and rich in behavioural patterns, often unique to individuals, which make them equivalent to personally identifiable information in some cases. In this paper, we generate synthetic transactional profiles using machine learning techniques with the goal to preserve both data utility and privacy. A challenge we faced was to deal with sparse vectors due to the few spending categories a client uses compared to all the ones available. We measured data utility by calculating common insights used by the banking industry on both the original and the synthetic data-set. Our approach shows that neural network models can generate valuable synthetic data in such context. Finally, we tried privacy-preserving techniques and observed its effect on models' performances.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/24/2023

A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic Data

Personal data collected at scale from surveys or digital devices offers ...
research
08/31/2023

The Use of Synthetic Data to Train AI Models: Opportunities and Risks for Sustainable Development

In the current data driven era, synthetic data, artificially generated d...
research
11/21/2022

A Framework for Auditable Synthetic Data Generation

Synthetic data has gained significant momentum thanks to sophisticated m...
research
07/07/2022

Privacy-Preserving Synthetic Educational Data Generation

Institutions collect massive learning traces but they may not disclose i...
research
09/27/2022

Privacy-Preserving Synthetic Data Generation for Recommendation Systems

Recommendation systems make predictions chiefly based on users' historic...
research
11/23/2022

Utility Assessment of Synthetic Data Generation Methods

Big data analysis poses the dual problem of privacy preservation and uti...
research
05/12/2020

Design of a Privacy-Preserving Data Platform for Collaboration Against Human Trafficking

Case records on identified victims of human trafficking are highly sensi...

Please sign up or login with your details

Forgot password? Click here to reset