Dragoman: Efficiently Evaluating Declarative Mapping Languages over Frameworks for Knowledge Graph Creation

10/26/2022
by   Samaneh Jozashoori, et al.
0

In recent years, there have been valuable efforts and contributions to make the process of RDF knowledge graph creation traceable and transparent; extending and applying declarative mapping languages is an example. One challenging step is the traceability of procedures that aim to overcome interoperability issues, a.k.a. data-level integration. In most pipelines, data integration is performed by ad-hoc programs, preventing traceability and reusability. However, formal frameworks provided by function-based declarative mapping languages such as FunUL and RML+FnO empower expressiveness. Data-level integration can be defined as functions and integrated as part of the mappings performing schema-level integration. However, combining functions with the mappings introduces a new source of complexity that can considerably impact the required number of resources and execution time. We tackle the problem of efficiently executing mappings with functions and formalize the transformation of them into function-free mappings. These transformations are the basis of an optimization process that aims to perform an eager evaluation of function-based mapping rules. These techniques are implemented in a framework named Dragoman. We demonstrate the correctness of the transformations while ensuring that the function-free data integration processes are equivalent to the original one. The effectiveness of Dragoman is empirically evaluated in 230 testbeds composed of various types of functions integrated with mapping rules of different complexity. The outcomes suggest that evaluating function-free mapping rules reduces execution time in complex knowledge graph creation pipelines composed of large data sources and multiple types of mapping rules. The savings can be up to 75 making these pipelines applicable and scalable in real-world settings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2020

FunMap: Efficient Execution of Functional Mappings for Knowledge Graph Creation

Data has exponentially grown in the last years, and knowledge graphs con...
research
12/14/2021

EABlock: A Declarative Entity Alignment Block for Knowledge Graph Creation Pipelines

Despite encoding enormous amount of rich and valuable data, existing dat...
research
11/05/2018

Data Integration for Supporting Biomedical Knowledge Graph Creation at Large-Scale

In recent years, following FAIR and open data principles, the number of ...
research
09/03/2019

MapSDI: A Scaled-up Semantic Data Integration Framework for Knowledge Graph Creation

Semantic web technologies have significantly contributed with effective ...
research
08/17/2020

SDM-RDFizer: An RML Interpreter for the Efficient Creation of RDF Knowledge Graphs

In recent years, the amount of data has increased exponentially, and kno...
research
01/24/2022

Scaling Up Knowledge Graph Creation to Large and Heterogeneous Data Sources

RDF knowledge graphs (KG) are powerful data structures to represent fact...
research
09/09/2009

Interactive Data Integration through Smart Copy & Paste

In many scenarios, such as emergency response or ad hoc collaboration, i...

Please sign up or login with your details

Forgot password? Click here to reset