SiGMa: Simple Greedy Matching for Aligning Large Knowledge Bases

07/19/2012
by   Simon Lacoste-Julien, et al.
0

The Internet has enabled the creation of a growing number of large-scale knowledge bases in a variety of domains containing complementary information. Tools for automatically aligning these knowledge bases would make it possible to unify many sources of structured knowledge and answer complex queries. However, the efficient alignment of large-scale knowledge bases still poses a considerable challenge. Here, we present Simple Greedy Matching (SiGMa), a simple algorithm for aligning knowledge bases with millions of entities and facts. SiGMa is an iterative propagation algorithm which leverages both the structural information from the relationship graph as well as flexible similarity measures between entity properties in a greedy local search, thus making it scalable. Despite its greedy nature, our experiments indicate that SiGMa can efficiently match some of the world's largest knowledge bases with high precision. We provide additional experiments on benchmark datasets which demonstrate that SiGMa can outperform state-of-the-art approaches both in accuracy and efficiency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/30/2021

Knowledge Base Completion Meets Transfer Learning

The aim of knowledge base completion is to predict unseen facts from exi...
research
12/19/2017

Large-Scale Vandalism Detection with Linear Classifiers - The Conkerberry Vandalism Detector at WSDM Cup 2017

Nowadays many artificial intelligence systems rely on knowledge bases fo...
research
12/24/2015

RDF2Rules: Learning Rules from RDF Knowledge Bases by Mining Frequent Predicate Cycles

Recently, several large-scale RDF knowledge bases have been built and ap...
research
08/20/2020

Language Models as Knowledge Bases: On Entity Representations, Storage Capacity, and Paraphrased Queries

Pretrained language models have been suggested as a possible alternative...
research
06/20/2018

Interpreting Embedding Models of Knowledge Bases: A Pedagogical Approach

Knowledge bases are employed in a variety of applications from natural l...
research
07/23/2018

A Cache-based Optimizer for Querying Enhanced Knowledge Bases

With recent emerging technologies such as the Internet of Things (IoT), ...
research
03/06/2020

Uncovering Hidden Semantics of Set Information in Knowledge Bases

Knowledge Bases (KBs) contain a wealth of structured information about e...

Please sign up or login with your details

Forgot password? Click here to reset