A Unified Labeling Approach by Pooling Diverse Datasets for Entity Typing

10/20/2018
by   Abhishek, et al.
0

Evolution of entity typing (ET) has led to the generation of multiple datasets. These datasets span from being coarse-grained to fine-grained encompassing numerous domains. Existing works primarily focus on improving the performance of a model on an individual dataset, independently. This narrowly focused view of ET causes two issues: 1) type assignment when information about the test data domain or target label set is not available; 2) fine-grained type prediction when there is no dataset in the same domain with finer-type annotations. Our goal is to shift the focus from individual domain-specific datasets to all the datasets available for ET. In our proposed approach, we convert the label set of all datasets to a unified hierarchical label set while preserving the semantic properties of the individual labels. Then utilizing a partial label loss, we train a single neural network based classifier using every available dataset for the ET task. We empirically evaluate the effectiveness of our approach on seven real-world diverse ET datasets. The results convey that the combined training on multiple datasets helps the model to generalize better and to predict fine-types across all domains without relying on a specific domain or label set information during evaluation.

READ FULL TEXT
research
03/20/2023

DocRED-FE: A Document-Level Fine-Grained Entity And Relation Extraction Dataset

Joint entity and relation extraction (JERE) is one of the most important...
research
09/12/2019

Fine-Grained Entity Typing for Domain Independent Entity Linking

Neural entity linking models are very powerful, but run the risk of over...
research
04/25/2017

Fine-Grained Entity Typing with High-Multiplicity Assignments

As entity type systems become richer and more fine-grained, we expect th...
research
07/07/2019

Zero-Shot Open Entity Typing as Type-Compatible Grounding

The problem of entity-typing has been studied predominantly in supervise...
research
04/30/2018

Types for Information Flow Control: Labeling Granularity and Semantic Models

Language-based information flow control (IFC) tracks dependencies within...
research
09/13/2021

Fine-grained Entity Typing via Label Reasoning

Conventional entity typing approaches are based on independent classific...
research
10/12/2022

Explore Contextual Information for 3D Scene Graph Generation

3D scene graph generation (SGG) has been of high interest in computer vi...

Please sign up or login with your details

Forgot password? Click here to reset