Grandma Karl is 27 years old – research agenda for pseudonymization of research data

08/30/2023
by   Elena Volodina, et al.
0

Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the personal and sensitive information which it contains, e.g names or political opinions. General Data Protection Regulation (GDPR) suggests pseudonymization as a solution to secure open access to research data, but we need to learn more about pseudonymization as an approach before adopting it for manipulation of research data. This paper outlines a research agenda within pseudonymization, namely need of studies into the effects of pseudonymization on unstructured data in relation to e.g. readability and language assessment, as well as the effectiveness of pseudonymization as a way of protecting writer identity, while also exploring different ways of developing context-sensitive algorithms for detection, labelling and replacement of personal information in unstructured data. The recently granted project on pseudonymization Grandma Karl is 27 years old addresses exactly those challenges.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2018

Personal research information system. About developing the methods for searching patent analogs of invention

The article describes information model and the method for searching pat...
research
06/01/2018

EU General Data Protection Regulation: A Gentle Introduction

The GDPR, or the Datenschutz Grundverordnung (DSGVO) in German, is an EU...
research
08/12/2022

Is Your Model Sensitive? SPeDaC: A New Benchmark for Detecting and Classifying Sensitive Personal Data

In recent years we have seen the exponential growth of applications, inc...
research
08/08/2021

Exploring the Personal Informatics Analysis Gap: "There's a Lot of Bacon"

Personal informatics research helps people track personal data for the p...
research
10/25/2012

A Biomimetic Approach Based on Immune Systems for Classification of Unstructured Data

In this paper we present the results of unstructured data clustering in ...
research
02/26/2019

An Abstract View on the De-anonymization Process

Over the recent years, the availability of datasets containing personal,...
research
05/25/2022

Are Large Pre-Trained Language Models Leaking Your Personal Information?

Large Pre-Trained Language Models (PLMs) have facilitated and dominated ...

Please sign up or login with your details

Forgot password? Click here to reset