Privacy preserving local analysis of digital trace data: A proof-of-concept

10/11/2021
by   Laura Boeschoten, et al.
0

We present PORT, a software platform for local data extraction and analysis of digital trace data. While digital trace data collected by private and public parties hold a huge potential for social-scientific discovery, their most useful parts have been unattainable for academic researchers due to privacy concerns and prohibitive API access. However, the EU General Data Protection Regulation (GDPR) grants all citizens the right to an electronic copy of their personal data. All major data controllers, such as social media platforms, banks, online shops, loyalty card systems and public transportation cards comply with this right by providing their clients with a `Data Download Package' (DDP). Previously, a conceptual workflow was introduced allowing citizens to donate their data to scientific- researchers. In this workflow, citizens' DDPs are processed locally on their machines before they are asked to provide informed consent to share a subset of the processed data with the researchers. In this paper, we present the newly developed software PORT that implements the local processing part of this workflow, protecting privacy by shielding sensitive data from any contact with outside observers – including the researchers themselves. Thus, PORT enables a host of potential applications of social data science to hitherto unobtainable data.

READ FULL TEXT

page 3

page 5

page 7

page 8

page 10

research
05/04/2021

Automatic de-identification of Data Download Packages

The General Data Protection Regulation (GDPR) grants all natural persons...
research
11/13/2020

Digital trace data collection through data donation

A potentially powerful method of social-scientific data collection and i...
research
01/16/2018

MORF: A Framework for MOOC Predictive Modeling and Replication At Scale

The MOOC Replication Framework (MORF) is a novel software system for fea...
research
02/23/2023

Don't Look at the Data! How Differential Privacy Reconfigures the Practices of Data Science

Across academia, government, and industry, data stewards are facing incr...
research
02/03/2022

Privacy-Aware Crowd Labelling for Machine Learning Tasks

The extensive use of online social media has highlighted the importance ...
research
10/19/2020

Private-Yet-Verifiable Contact Tracing

We propose PrYVeCT, a private-yet-verifiable contact tracing system. PrY...
research
04/20/2022

Exploring Widevine for Fun and Profit

For years, Digital Right Management (DRM) systems have been used as the ...

Please sign up or login with your details

Forgot password? Click here to reset