Distributed Subweb Specifications for Traversing the Web

02/28/2023
by   Bart Bogaerts, et al.
0

Link Traversal-based Query Processing (ltqp), in which a sparql query is evaluated over a web of documents rather than a single dataset, is often seen as a theoretically interesting yet impractical technique. However, in a time where the hypercentralization of data has increasingly come under scrutiny, a decentralized Web of Data with a simple document-based interface is appealing, as it enables data publishers to control their data and access rights. While ltqp allows evaluating complex queries over such webs, it suffers from performance issues (due to the high number of documents containing data) as well as information quality concerns (due to the many sources providing such documents). In existing ltqp approaches, the burden of finding sources to query is entirely in the hands of the data consumer. In this paper, we argue that to solve these issues, data publishers should also be able to suggest sources of interest and guide the data consumer towards relevant and trustworthy data. We introduce a theoretical framework that enables such guided link traversal and study its properties. We illustrate with a theoretic example that this can improve query results and reduce the number of network requests. We evaluate our proposal experimentally on a virtual linked web with specifications and indeed observe that not just the data quality but also the efficiency of querying improves. Under consideration in Theory and Practice of Logic Programming (TPLP).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2020

Guided Link-Traversal-Based Query Processing

Link-Traversal-Based Query Processing (LTBQP) is a technique for evaluat...
research
12/13/2018

How Many and What Types of SPARQL Queries can be Answered through Zero-Knowledge Link Traversal?

The current de-facto way to query the Web of Data is through the SPARQL ...
research
10/17/2022

Estimating the Cost of Executing Link Traversal based SPARQL Queries

An increasing number of organisations in almost all fields have started ...
research
02/14/2023

Evaluation of Link Traversal Query Execution over Decentralized Environments with Structural Assumptions

To counter societal and economic problems caused by data silos on the We...
research
12/18/2019

Data Services with Bindaas: RESTful Interfaces for Diverse Data Sources

The diversity of data management systems affords developers the luxury o...
research
02/19/2016

Ordonnancement d'entités pour la rencontre du web des documents et du web des données

The advances of the Linked Open Data (LOD) initiative are giving rise to...
research
07/13/2021

Querying Linked Data: how to ensure user's quality requirements

In the distributed and dynamic framework of the Web, data quality is a b...

Please sign up or login with your details

Forgot password? Click here to reset