Constructing a Knowledge Graph from Unstructured Documents without External Alignment

by   Seunghak Yu, et al.

Knowledge graphs (KGs) are relevant to many NLP tasks, but building a reliable domain-specific KG is time-consuming and expensive. A number of methods for constructing KGs with minimized human intervention have been proposed, but still require a process to align into the human-annotated knowledge base. To overcome this issue, we propose a novel method to automatically construct a KG from unstructured documents that does not require external alignment and explore its use to extract desired information. To summarize our approach, we first extract knowledge tuples in their surface form from unstructured documents, encode them using a pre-trained language model, and link the surface-entities via the encoding to form the graph structure. We perform experiments with benchmark datasets such as WikiMovies and MetaQA. The experimental results show that our method can successfully create and search a KG with 18K documents and achieve 69.7 query retrieval task.



There are no comments yet.


page 1

page 2

page 3

page 4


BERT-based knowledge extraction method of unstructured domain text

With the development and business adoption of knowledge graph, there is ...

Exploiting Structured Knowledge in Text via Graph-Guided Representation Learning

In this work, we aim at equipping pre-trained language models with struc...

Open-domain Dialogue Generation Grounded with Dynamic Multi-form Knowledge Fusion

Open-domain multi-turn conversations normally face the challenges of how...

Enhanced Temporal Knowledge Embeddings with Contextualized Language Representations

With the emerging research effort to integrate structured and unstructur...

Bootstrapping Text Anonymization Models with Distant Supervision

We propose a novel method to bootstrap text anonymization models based o...

Hierarchical Neural Network for Extracting Knowledgeable Snippets and Documents

In this study, we focus on extracting knowledgeable snippets and annotat...

Explainable Graph-based Search for Lessons-Learned Documents in the Semiconductor Industry

Industrial processes produce a considerable volume of data and thus info...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.