S3BD: Secure Semantic Search over Encrypted Big Data in the Cloud

09/21/2018
by   Jason Woodworth, et al.
0

Cloud storage is a widely utilized service for both personal and enterprise demands. However, despite its advantages, many potential users with enormous amounts of sensitive data (big data) refrain from fully utilizing the cloud storage service due to valid concerns about data privacy. An established solution to the cloud data privacy problem is to perform encryption on the client-end. This approach, however, restricts data processing capabilities (eg, searching over the data). Accordingly, the research problem we investigate is how to enable real-time searching over the encrypted big data in the cloud. In particular, semantic search is of interest to clients dealing with big data. To address this problem, in this research, we develop a system (termed S3BD) for searching big data using cloud services without exposing any data to cloud providers. To keep real-time response on big data, S3BD proactively prunes the search space to a subset of the whole dataset. For that purpose, we propose a method to cluster the encrypted data. An abstract of each cluster is maintained on the client-end to navigate the search operation to appropriate clusters at the search time. Results of experiments, carried out on real-world big datasets, demonstrate that the search operation can be achieved in real-time and is significantly more efficient than other counterparts. In addition, a fully functional prototype of S3BD is made publicly available.

READ FULL TEXT
research
08/10/2019

Edge Computing for User-Centric Secure Search on Cloud-Based Encrypted Big Data

Cloud service providers offer a low-cost and convenient solution to host...
research
05/22/2020

Privacy-Preserving Clustering of Unstructured Big Data for Cloud-Based Enterprise Search Solutions

Cloud-based enterprise search services (e.g., Amazon Kendra) are enchant...
research
08/14/2019

ClustCrypt: Privacy-Preserving Clustering of Unstructured Big Data in the Cloud

Security and confidentiality of big data stored in the cloud are importa...
research
02/26/2021

SAED: Edge-Based Intelligence for Privacy-Preserving Enterprise Search on the Cloud

Cloud-based enterprise search services (e.g., AWS Kendra) have been entr...
research
12/19/2019

Is Big Data Performance Reproducible in Modern Cloud Networks?

Performance variability has been acknowledged as a problem for over a de...
research
03/06/2020

A Hierarchical Semantic Overlay for P2P Search

In this paper, we propose a hierarchical semantic overlay network for se...
research
07/20/2020

A Big Data Approach for Sequences Indexing on the Cloud via Burrows Wheeler Transform

Indexing sequence data is important in the context of Precision Medicine...

Please sign up or login with your details

Forgot password? Click here to reset