AI-based Re-identification of Behavioral Clickstream Data

01/21/2022
by   Stefan Vamosi, et al.
0

AI-based face recognition, i.e., the re-identification of individuals within images, is an already well established technology for video surveillance, for user authentication, for tagging photos of friends, etc. This paper demonstrates that similar techniques can be applied to successfully re-identify individuals purely based on their behavioral patterns. In contrast to de-anonymization attacks based on record linkage, these methods do not require any overlap in data points between a released dataset and an identified auxiliary dataset. The mere resemblance of behavioral patterns between records is sufficient to correctly attribute behavioral data to identified individuals. Further, we can demonstrate that data perturbation does not provide protection, unless a significant share of data utility is being destroyed. These findings call for sincere cautions when sharing actual behavioral data with third parties, as modern-day privacy regulations, like the GDPR, define their scope based on the ability to re-identify. This has also strong implications for the Marketing domain, when dealing with potentially re-identify-able data sources like shopping behavior, clickstream data or cockies. We also demonstrate how synthetic data can offer a viable alternative, that is shown to be resilient against our introduced AI-based re-identification attacks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/24/2023

A Linear Reconstruction Approach for Attribute Inference Attacks against Synthetic Data

Personal data collected at scale from surveys or digital devices offers ...
research
12/09/2015

Where You Are Is Who You Are: User Identification by Matching Statistics

Most users of online services have unique behavioral or usage patterns. ...
research
12/15/2017

Health Data in an Open World

With the aim of informing sound policy about data sharing and privacy, w...
research
02/23/2018

Behavioral-clinical phenotyping with type 2 diabetes self-monitoring data

Objective: To evaluate unsupervised clustering methods for identifying i...
research
08/14/2019

Stop the Open Data Bus, We Want to Get Off

The subject of this report is the re-identification of individuals in th...
research
03/12/2018

idtracker.ai: Tracking all individuals in large collectives of unmarked animals

Our understanding of collective animal behavior is limited by our abilit...
research
06/24/2019

AnonTokens: tracing re-identification attacks through decoy records

Privacy is of the utmost concern when it comes to releasing data to thir...

Please sign up or login with your details

Forgot password? Click here to reset