Understood in Translation, Transformers for Domain Understanding

Knowledge acquisition is the essential first step of any Knowledge Graph (KG) application. This knowledge can be extracted from a given corpus (KG generation process) or specified from an existing KG (KG specification process). Focusing on domain specific solutions, knowledge acquisition is a labor intensive task usually orchestrated and supervised by subject matter experts. Specifically, the domain of interest is usually manually defined and then the needed generation or extraction tools are utilized to produce the KG. Herein, we propose a supervised machine learning method, based on Transformers, for domain definition of a corpus. We argue why such automated definition of the domain's structure is beneficial both in terms of construction time and quality of the generated graph. The proposed method is extensively validated on three public datasets (WebNLG, NYT and DocRED) by comparing it with two reference methods based on CNNs and RNNs models. The evaluation shows the efficiency of our model in this task. Focusing on scientific document understanding, we present a new health domain dataset based on publications extracted from PubMed and we successfully utilize our method on this. Lastly, we demonstrate how this work lays the foundation for fully automated and unsupervised KG generation.

READ FULL TEXT
research
11/14/2021

CDM: Combining Extraction and Generation for Definition Modeling

Definitions are essential for term understanding. Recently, there is an ...
research
10/26/2020

Acquiring domain models

Whereas a Learning Apprentice System stresses the generation and refinem...
research
01/16/2013

A Knowledge Acquisition Tool for Bayesian-Network Troubleshooters

This paper describes a domain-specific knowledge acquisition tool for in...
research
04/18/2021

Knowledge Graph Anchored Information-Extraction for Domain-Specific Insights

The growing quantity and complexity of data pose challenges for humans t...
research
11/24/2020

Tackling Domain-Specific Winograd Schemas with Knowledge-Based Reasoning and Machine Learning

The Winograd Schema Challenge (WSC) is a common-sense reasoning task tha...
research
08/12/2019

Assessing the Quality of Scientific Papers

A multitude of factors are responsible for the overall quality of scient...
research
08/08/2023

Adapting Foundation Models for Information Synthesis of Wireless Communication Specifications

Existing approaches to understanding, developing and researching modern ...

Please sign up or login with your details

Forgot password? Click here to reset