Fine Grained Classification of Personal Data Entities

11/23/2018
by   Riddhiman Dasgupta, et al.
0

Entity Type Classification can be defined as the task of assigning category labels to entity mentions in documents. While neural networks have recently improved the classification of general entity mentions, pattern matching and other systems continue to be used for classifying personal data entities (e.g. classifying an organization as a media company or a government institution for GDPR, and HIPAA compliance). We propose a neural model to expand the class of personal data entities that can be classified at a fine grained level, using the output of existing pattern matching systems as additional contextual features. We introduce new resources, a personal data entities hierarchy with 134 types, and two datasets from the Wikipedia pages of elected representatives and Enron emails. We hope these resource will aid research in the area of personal data discovery, and to that effect, we provide baseline results on these datasets, and compare our method with state of the art models on OntoNotes dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/03/2014

Context-Dependent Fine-Grained Entity Type Tagging

Entity type tagging is the task of assigning category labels to each men...
research
04/25/2017

Fine-Grained Entity Typing with High-Multiplicity Assignments

As entity type systems become richer and more fine-grained, we expect th...
research
04/19/2016

An Attentive Neural Architecture for Fine-grained Entity Type Classification

In this work we propose a novel attention-based neural network model for...
research
11/15/2017

Finer Grained Entity Typing with TypeNet

We consider the challenging problem of entity typing over an extremely f...
research
01/21/2020

Classifying Wikipedia in a fine-grained hierarchy: what graphs can contribute

Wikipedia is a huge opportunity for machine learning, being the largest ...
research
05/22/2023

EnCore: Pre-Training Entity Encoders using Coreference Chains

Entity typing is the task of assigning semantic types to the entities th...
research
08/06/2020

Fine-Grained Complexity of Regular Expression Pattern Matching and Membership

The currently fastest algorithm for regular expression pattern matching ...

Please sign up or login with your details

Forgot password? Click here to reset