BERT-based knowledge extraction method of unstructured domain text

03/01/2021
by   Wang Zijia, et al.
0

With the development and business adoption of knowledge graph, there is an increasing demand for extracting entities and relations of knowledge graphs from unstructured domain documents. This makes the automatic knowledge extraction for domain text quite meaningful. This paper proposes a knowledge extraction method based on BERT, which is used to extract knowledge points from unstructured specific domain texts (such as insurance clauses in the insurance industry) automatically to save manpower of knowledge graph construction. Different from the commonly used methods which are based on rules, templates or entity extraction models, this paper converts the domain knowledge points into question and answer pairs and uses the text around the answer in documents as the context. The method adopts a BERT-based model similar to BERT's SQuAD reading comprehension task. The model is fine-tuned. And it is used to directly extract knowledge points from more insurance clauses. According to the test results, the model performance is good.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2022

Step out of KG: Knowledge Graph Completion via Knowledgeable Retrieval and Reading Comprehension

Knowledge graphs, as the cornerstone of many AI applications, usually fa...
research
04/09/2021

KI-BERT: Infusing Knowledge Context for Better Language and Domain Understanding

Contextualized entity representations learned by state-of-the-art deep l...
research
05/16/2023

Growing and Serving Large Open-domain Knowledge Graphs

Applications of large open-domain knowledge graphs (KGs) to real-world p...
research
08/20/2020

Constructing a Knowledge Graph from Unstructured Documents without External Alignment

Knowledge graphs (KGs) are relevant to many NLP tasks, but building a re...
research
02/04/2022

Extracting Software Requirements from Unstructured Documents

Requirements identification in textual documents or extraction is a tedi...
research
11/06/2018

Parser Extraction of Triples in Unstructured Text

The web contains vast repositories of unstructured text. We investigate ...
research
05/16/2023

Constructing and Interpreting Causal Knowledge Graphs from News

Many jobs rely on news to learn about causal events in the past and pres...

Please sign up or login with your details

Forgot password? Click here to reset