Efficient SPARQL Autocompletion via SPARQL

04/29/2021
by   Hannah Bast, et al.
0

We show how to achieve fast autocompletion for SPARQL queries on very large knowledge bases. At any position in the body of a SPARQL query, the autocompletion suggests matching subjects, predicates, or objects. The suggestions are context-sensitive in the sense that they lead to a non-empty result and are ranked by their relevance to the part of the query already typed. The suggestions can be narrowed down by prefix search on the names and aliases of the desired subject, predicate, or object. All suggestions are themselves obtained via SPARQL queries, which we call autocompletion queries. For existing SPARQL engines, these queries are impractically slow on large knowledge bases. We present various algorithmic and engineering improvements of an existing SPARQL engine such that these autocompletion queries are executed efficiently. We provide an extensive evaluation of a variety of suggestion methods on three large knowledge bases, including Wikidata (6.9B triples). We explore the trade-off between the relevance of the suggestions and the processing time of the autocompletion queries. We compare our results with two widely used SPARQL engines, Virtuoso and Blazegraph. On Wikidata, we achieve fully sensitive suggestions with sub-second response times for over 90 large and diverse set of thousands of autocompletion queries. Materials for full reproducibility, an interactive evaluation web app, and a demo are available on: https://ad.informatik.uni-freiburg.de/publications .

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/25/2021

Developing a Meta-suggestion Engine for Search Query

With the development of the Internet and the accumulation of information...
research
11/20/2017

Spec-QP: Speculative Query Planning for Joins over Knowledge Graphs

Organisations store huge amounts of data from multiple heterogeneous sou...
research
09/22/2017

Mining User Queries with Information Extraction Methods and Linked Data

Purpose: Advanced usage of Web Analytics tools allows to capture the con...
research
12/01/2018

Approximating Categorical Similarity in Sponsored Search Relevance

Sponsored Search is a major source of revenue for web search engines. Si...
research
08/25/2022

Fundamentals of Task-Agnostic Data Valuation

We study valuing the data of a data owner/seller for a data seeker/buyer...
research
08/08/2021

Fairest Neighbors: Tradeoffs Between Metric Queries

Metric search commonly involves finding objects similar to a given sampl...
research
03/25/2011

An Empirical Study of Real-World SPARQL Queries

Understanding how users tailor their SPARQL queries is crucial when desi...

Please sign up or login with your details

Forgot password? Click here to reset