KGClean: An Embedding Powered Knowledge Graph Cleaning Framework

04/26/2020
by   Congcong Ge, et al.
0

The quality assurance of the knowledge graph is a prerequisite for various knowledge-driven applications. We propose KGClean, a novel cleaning framework powered by knowledge graph embedding, to detect and repair the heterogeneous dirty data. In contrast to previous approaches that either focus on filling missing data or clean errors violated limited rules, KGClean enables (i) cleaning both missing data and other erroneous values, and (ii) mining potential rules automatically, which expands the coverage of error detecting. KGClean first learns data representations by TransGAT, an effective knowledge graph embedding model, which gathers the neighborhood information of each data and incorporates the interactions among data for casting data to continuous vector spaces with rich semantics. KGClean integrates an active learning-based classification model, which identifies errors with a small seed of labels. KGClean utilizes an efficient PRO-repair strategy to repair errors using a novel concept of propagation power. Extensive experiments on four typical knowledge graphs demonstrate the effectiveness of KGClean in practice.

READ FULL TEXT

page 3

page 4

page 6

page 7

page 8

page 9

page 13

page 14

research
11/20/2019

Joint Embedding Learning of Educational Knowledge Graphs

As an efficient model for knowledge organization, the knowledge graph ha...
research
02/21/2022

Dynamic Relation Repairing for Knowledge Enhancement

Dynamic relation repair aims to efficiently validate and repair the inst...
research
03/09/2019

Logic Rules Powered Knowledge Graph Embedding

Large scale knowledge graph embedding has attracted much attention from ...
research
03/23/2020

What is Normal, What is Strange, and What is Missing in a Knowledge Graph: Unified Characterization via Inductive Summarization

Knowledge graphs (KGs) store highly heterogeneous information about the ...
research
08/17/2022

Knowledge Graph Curation: A Practical Framework

Knowledge Graphs (KGs) have shown to be very important for applications ...
research
09/01/2020

More is not Always Better: The Negative Impact of A-box Materialization on RDF2vec Knowledge Graph Embeddings

RDF2vec is an embedding technique for representing knowledge graph entit...
research
09/25/2018

TTMF: A Triple Trustworthiness Measurement Frame for Knowledge Graphs

The Knowledge graph (KG) uses the triples to describe the facts in the r...

Please sign up or login with your details

Forgot password? Click here to reset