TIDF-DLPM: Term and Inverse Document Frequency based Data Leakage Prevention Model

03/10/2022
by   Ishu Gupta, et al.
0

Confidentiality of the data is being endangered as it has been categorized into false categories which might get leaked to an unauthorized party. For this reason, various organizations are mainly implementing data leakage prevention systems (DLPs). Firewalls and intrusion detection systems are being outdated versions of security mechanisms. The data which are being used, in sending state or are rest are being monitored by DLPs. The confidential data is prevented with the help of neighboring contexts and contents of DLPs. In this paper, a semantic-based approach is used to classify data based on the statistical data leakage prevention model. To detect involved private data, statistical analysis is being used to contribute secure mechanisms in the environment of data leakage. The favored Frequency-Inverse Document Frequency (TF-IDF) is the facts and details recapture function to arrange documents under particular topics. The results showcase that a similar statistical DLP approach could appropriately classify documents in case of extent alteration as well as interchanged documents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/06/2022

Evaluation of Partition-Based Text Clustering Techniques to Categorize Indic Language Documents

Wide availability of electronic data has led to the vast interest in tex...
research
09/10/2018

Is Leakage Power a Linear Function of Temperature?

In this work, we present a study of the leakage power modeling technique...
research
09/07/2022

Data Leakage in Notebooks: Static Detection and Better Processes

Data science pipelines to train and evaluate models with machine learnin...
research
11/29/2022

Abstract Interpretation-Based Data Leakage Static Analysis

Data leakage is a well-known problem in machine learning. Data leakage o...
research
06/09/2023

McFIL: Model Counting Functionality-Inherent Leakage

Protecting the confidentiality of private data and using it for useful c...
research
04/17/2021

SoK: Design Tools for Side-Channel-Aware Implementions

Side-channel attacks that leak sensitive information through a computing...
research
02/08/2015

Improving Term Frequency Normalization for Multi-topical Documents, and Application to Language Modeling Approaches

Term frequency normalization is a serious issue since lengths of documen...

Please sign up or login with your details

Forgot password? Click here to reset