Towards Automatic Comparison of Data Privacy Documents: A Preliminary Experiment on GDPR-like Laws

05/21/2021
by   Kornraphop Kawintiranon, et al.
0

General Data Protection Regulation (GDPR) becomes a standard law for data protection in many countries. Currently, twelve countries adopt the regulation and establish their GDPR-like regulation. However, to evaluate the differences and similarities of these GDPR-like regulations is time-consuming and needs a lot of manual effort from legal experts. Moreover, GDPR-like regulations from different countries are written in their languages leading to a more difficult task since legal experts who know both languages are essential. In this paper, we investigate a simple natural language processing (NLP) approach to tackle the problem. We first extract chunks of information from GDPR-like documents and form structured data from natural language. Next, we use NLP methods to compare documents to measure their similarity. Finally, we manually label a small set of data to evaluate our approach. The empirical result shows that the BERT model with cosine similarity outperforms other baselines. Our data and code are publicly available.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2022

ClassActionPrediction: A Challenging Benchmark for Legal Judgment Prediction of Class Action Cases in the US

The research field of Legal Natural Language Processing (NLP) has been v...
research
04/14/2022

Brazilian Court Documents Clustered by Similarity Together Using Natural Language Processing Approaches with Transformers

Recent advances in Artificial intelligence (AI) have leveraged promising...
research
06/29/2023

Towards Grammatical Tagging for the Legal Language of Cybersecurity

Legal language can be understood as the language typically used by those...
research
11/01/2022

Should I disclose my dataset? Caveats between reproducibility and individual data rights

Natural language processing techniques have helped domain experts solve ...
research
01/16/2023

Towards an Automatic Consolidation of French Law

We present preliminary results about Legistix, a tool we are developing ...
research
05/26/2023

An Interactive Decision Support System for Analyzing Time Related Restrictions in Renaturation and Redevelopment Planning Projects

The operation of open-cast lignite mines is a large intervention in natu...
research
04/24/2023

ThreatCrawl: A BERT-based Focused Crawler for the Cybersecurity Domain

Publicly available information contains valuable information for Cyber T...

Please sign up or login with your details

Forgot password? Click here to reset