OBDA for the Web: Creating Virtual RDF Graphs On Top of Web Data Sources

05/22/2020
by   Konstantina Bereta, et al.
0

Due to Variety, Web data come in many different structures and formats, with HTML tables and REST APIs (e.g., social media APIs) being among the most popular ones. A big subset of Web data is also characterised by Velocity, as data gets frequently updated so that consumers can obtain the most up-to-date version of the respective datasets. At the moment, though, these data sources are not effectively supported by Semantic Web tools. To address variety and velocity, we propose Ontop4theWeb, a system that maps Web data of various formats into virtual RDF triples, thus allowing for querying them on-the-fly without materializing them as RDF. We demonstrate how Ontop4theWeb can use SPARQL to uniformly query popular, but heterogeneous Web data sources, like HTML tables and Web APIs. We showcase our approach in a number of use cases, such as Twitter, Foursquare, Yelp and HTML tables. We carried out a thorough experimental evaluation which verifies the high efficiency of our framework, which goes beyond the current state-of-the-art in this area, in terms of both functionality and performance.

READ FULL TEXT
research
10/05/2021

Data Validation for Big Live Data

Data Integration of heterogeneous data sources relies either on periodic...
research
12/18/2019

Data Services with Bindaas: RESTful Interfaces for Diverse Data Sources

The diversity of data management systems affords developers the luxury o...
research
03/31/2020

The Case For Alternative Web Archival Formats To Expedite The Data-To-Insight Cycle

The WARC file format is widely used by web archives to preserve collecte...
research
11/25/2017

Privacy Risks from Public Data Sources

In the fight against tax evaders and other cheats, governments seek to g...
research
06/16/2020

Applying Social Event Data for the Management of Cellular Networks

Internet provides a growing variety of social data sources: calendars, e...
research
12/08/2022

GenSyn: A Multi-stage Framework for Generating Synthetic Microdata using Macro Data Sources

Individual-level data (microdata) that characterizes a population, is es...
research
05/20/2019

Ingesting High-Velocity Streaming Graphs from Social Media Sources

Many data science applications like social network analysis use graphs a...

Please sign up or login with your details

Forgot password? Click here to reset