Privacy Risks from Public Data Sources

11/25/2017
by   Zacharias Tzermias, et al.
0

In the fight against tax evaders and other cheats, governments seek to gather more information about their citizens. In this paper we claim that this increased transparency, combined with ineptitude, or corruption, can lead to widespread violations of privacy, ultimately harming law-abiding individuals while helping those engaged in criminal activities such as stalking, identity theft and so on. In this paper we survey a number of data sources administrated by the Greek state, offered as web services, to investigate whether they can lead to leakage of sensitive information. Our study shows that we were able to download significant portions of the data stored in some of these data sources (scraping). Moreover, for those data sources that were not amenable to scraping we looked at ways of extracting information for specific individuals that we had identified by looking at other data sources. The vulnerabilities we have discovered enable the collection of personal data and, thus, open the way for a variety of impersonation attacks, identity theft, confidence trickster attacks and so on. We believe that the lack of a big picture which was caused by the piecemeal development of these data sources hides the true extent of the threat. Hence, by looking at all these data sources together, we outline a number of mitigation strategies that can alleviate some of the most obvious attack strategies. Finally, we look at measures that can be taken in the longer term to safeguard the privacy of the citizens.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/20/2020

A Model-based Chatbot Generation Approach to Converse with Open Data Sources

The Open Data movement promotes the free distribution of data. More and ...
research
01/16/2020

A Common Operating Picture Framework Leveraging Data Fusion and Deep Learning

Organizations are starting to realize of the combined power of data and ...
research
10/05/2021

Data Validation for Big Live Data

Data Integration of heterogeneous data sources relies either on periodic...
research
05/07/2022

Airport Digital Twins for Resilient Disaster Management Response

Airports are constantly facing a variety of hazards and threats from nat...
research
05/22/2020

OBDA for the Web: Creating Virtual RDF Graphs On Top of Web Data Sources

Due to Variety, Web data come in many different structures and formats, ...
research
09/22/2022

Linking Contexts from Distinct Data Sources in Zero Trust Federation

An access control model called Zero Trust Architecture (ZTA) has attract...
research
05/18/2023

Towards the Automatic Generation of Conversational Interfaces to Facilitate the Exploration of Tabular Data

Tabular data is the most common format to publish and exchange structure...

Please sign up or login with your details

Forgot password? Click here to reset