A Frequent Itemset Hiding Toolbox

02/28/2018
by   Vasileios Kagklis, et al.
0

Advances in data collection and data storage technologies have given way to the establishment of transactional databases among companies and organizations, as they allow enormous amounts of data to be stored efficiently. Useful knowledge can be mined from these data, which can be used in several ways depending on the nature of the data. Quite often companies and organizations are willing to share data for the sake of mutual benefit. However, the sharing of such data comes with risks, as problems with privacy may arise. Sensitive data, along with sensitive knowledge inferred from this data, must be protected from unintentional exposure to unauthorized parties. One form of the inferred knowledge is frequent patterns mined in the form of frequent itemsets from transactional databases. The problem of protecting such patterns is known as the frequent itemset hiding problem. In this paper we present a toolbox, which provides several implementations of frequent itemset hiding algorithms. Firstly, we summarize the most important aspects of each algorithm. We then introduce the architecture of the toolbox and its novel features. Finally, we provide experimental results on real world datasets, demonstrating the efficiency of the toolbox and the convenience it offers in comparing different algorithms.

READ FULL TEXT
research
03/20/2019

Extracting Frequent Gradual Patterns Using Constraints Modeling

In this paper, we propose a constraint-based modeling approach for the p...
research
05/12/2021

Frequent Pattern Mining in Continuous-time Temporal Networks

Networks are used as highly expressive tools in different disciplines. I...
research
06/26/2017

Private Data System Enabling Self-Sovereign Storage Managed by Executable Choreographies

With the increased use of Internet, governments and large companies stor...
research
12/19/2019

Fast Mining of Spatial Frequent Wordset from Social Database

In this paper, we propose an algorithm that extracts spatial frequent pa...
research
01/29/2021

Finding the Sweet Spot for Data Anonymization: A Mechanism Design Perspective

Data sharing between different organizations is an essential process in ...
research
12/10/2020

A novel algorithm for clearing financial obligations between companies – an application within the Romanian Ministry of Economy

The concept of clearing or netting, as defined in the glossaries of Euro...

Please sign up or login with your details

Forgot password? Click here to reset