An Analytical Study of Large SPARQL Query Logs

08/01/2017
by   Angela Bonifati, et al.
0

With the adoption of RDF as the data model for Linked Data and the Semantic Web, query specification from end- users has become more and more common in SPARQL end- points. In this paper, we conduct an in-depth analytical study of the queries formulated by end-users and harvested from large and up-to-date query logs from a wide variety of RDF data sources. As opposed to previous studies, ours is the first assessment on a voluminous query corpus, span- ning over several years and covering many representative SPARQL endpoints. Apart from the syntactical structure of the queries, that exhibits already interesting results on this generalized corpus, we drill deeper in the structural char- acteristics related to the graph- and hypergraph represen- tation of queries. We outline the most common shapes of queries when visually displayed as pseudographs, and char- acterize their (hyper-)tree width. Moreover, we analyze the evolution of queries over time, by introducing the novel con- cept of a streak, i.e., a sequence of queries that appear as subsequent modifications of a seed query. Our study offers several fresh insights on the already rich query features of real SPARQL queries formulated by real users, and brings us to draw a number of conclusions and pinpoint future di- rections for SPARQL query evaluation, query optimization, tuning, and benchmarking.

READ FULL TEXT

page 5

page 16

research
12/13/2018

How Many and What Types of SPARQL Queries can be Answered through Zero-Knowledge Link Traversal?

The current de-facto way to query the Web of Data is through the SPARQL ...
research
10/02/2015

Automatic Taxonomy Extraction from Query Logs with no Additional Sources of Information

Search engine logs store detailed information on Web users interactions....
research
03/25/2011

An Empirical Study of Real-World SPARQL Queries

Understanding how users tailor their SPARQL queries is crucial when desi...
research
04/01/2011

Towards an automated query modification assistant

Users who need several queries before finding what they need can benefit...
research
04/04/2020

Towards Query Logs for Privacy Studies: On Deriving Search Queries from Questions

Translating verbose information needs into crisp search queries is a phe...
research
05/01/2020

Studying Ransomware Attacks Using Web Search Logs

Cyber attacks are increasingly becoming prevalent and causing significan...
research
06/20/2019

Extracting Basic Graph Patterns from Triple Pattern Fragment Logs

The Triple Pattern Fragment (TPF) approach is de-facto a new way to publ...

Please sign up or login with your details

Forgot password? Click here to reset