Privacy-Preserving Clustering of Unstructured Big Data for Cloud-Based Enterprise Search Solutions

05/22/2020
by   SM Zobaed, et al.
0

Cloud-based enterprise search services (e.g., Amazon Kendra) are enchanting to big data owners by providing them with convenient search solutions over their enterprise big datasets. However, individuals and businesses that deal with confidential big data (eg, credential documents) are reluctant to fully embrace such services, due to valid concerns about data privacy. Solutions based on client-side encryption have been explored to mitigate privacy concerns. Nonetheless, such solutions hinder data processing, specifically clustering, which is pivotal in dealing with different forms of big data. For instance, clustering is critical to limit the search space and perform real-time search operations on big datasets. To overcome the hindrance in clustering encrypted big data, we propose privacy-preserving clustering schemes for three forms of unstructured encrypted big datasets, namely static, semi-dynamic, and dynamic datasets. To preserve data privacy, the proposed clustering schemes function based on statistical characteristics of the data and determine (A) the suitable number of clusters and (B) appropriate content for each cluster. Experimental results obtained from evaluating the clustering schemes on three different datasets demonstrate between 30 on the clusters' coherency compared to other clustering schemes for encrypted data. Employing the clustering schemes in a privacy-preserving enterprise search system decreases its search time by up to 78 search accuracy by up to 35

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2019

ClustCrypt: Privacy-Preserving Clustering of Unstructured Big Data in the Cloud

Security and confidentiality of big data stored in the cloud are importa...
research
02/26/2021

SAED: Edge-Based Intelligence for Privacy-Preserving Enterprise Search on the Cloud

Cloud-based enterprise search services (e.g., AWS Kendra) have been entr...
research
09/21/2018

S3BD: Secure Semantic Search over Encrypted Big Data in the Cloud

Cloud storage is a widely utilized service for both personal and enterpr...
research
06/27/2019

Distributed Clustering in the Anonymized Space with Local Differential Privacy

Clustering and analyzing on collected data can improve user experiences ...
research
08/10/2019

Edge Computing for User-Centric Secure Search on Cloud-Based Encrypted Big Data

Cloud service providers offer a low-cost and convenient solution to host...
research
05/11/2019

GraphSE^2: An Encrypted Graph Database for Privacy-Preserving Social Search

In this paper, we propose GraphSE^2, an encrypted graph database for onl...
research
11/18/2022

IEEE Big Data Cup 2022: Privacy Preserving Matching of Encrypted Images with Deep Learning

Smart sensors, devices and systems deployed in smart cities have brought...

Please sign up or login with your details

Forgot password? Click here to reset