DeepAI AI Chat
Log In Sign Up

Efficient Exploration of Interesting Aggregates in RDF Graphs

03/31/2021
by   Yanlei Diao, et al.
Inria
Ecole Polytechnique
0

As large Open Data are increasingly shared as RDF graphs today, there is a growing demand to help users discover the most interesting facets of a graph, which are often hard to grasp without automatic tools. We consider the problem of automatically identifying the k most interesting aggregate queries that can be evaluated on an RDF graph, given an integer k and a user-specified interestingness function. Our problem departs from analytics in relational data warehouses in that (i) in an RDF graph we are not given but we must identify the facts, dimensions, and measures of candidate aggregates; (ii) the classical approach to efficiently evaluating multiple aggregates breaks in the face of multi-valued dimensions in RDF data. In this work, we propose an extensible end-to-end framework that enables the identification and evaluation of interesting aggregates based on a new RDF-compatible one-pass algorithm for efficiently evaluating a lattice of aggregates and a novel early-stop technique (with probabilistic guarantees) that can prune uninteresting aggregates. Experiments using both real and synthetic graphs demonstrate the ability of our framework to find interesting aggregates in a large search space, the efficiency of our algorithms (with up to 2.9x speedup over a similar pipeline based on existing algorithms), and scalability as the data size and complexity grow.

READ FULL TEXT

page 1

page 2

page 3

page 4

06/12/2019

Kaskade: Graph Views for Efficient Graph Analytics

Graphs are an increasingly popular way to model real-world entities and ...
06/17/2017

An Efficient Probabilistic Approach for Graph Similarity Search

Graph similarity search is a common and fundamental operation in graph d...
07/03/2019

A Software Framework and Datasets for the Analysis of Graph Measures on RDF Graphs

As the availability and the inter-connectivity of RDF datasets grow, so ...
07/31/2020

Neural Architecture Search in Graph Neural Networks

Performing analytical tasks over graph data has become increasingly inte...
09/14/2022

PAPyA: Performance Analysis of Large RDF Graphs Processing Made Easy

Prescriptive Performance Analysis (PPA) has shown to be more useful than...
09/27/2015

Approximation and Heuristic Algorithms for Probabilistic Physical Search on General Graphs

We consider an agent seeking to obtain an item, potentially available at...
06/12/2019

Neural Graph Evolution: Towards Efficient Automatic Robot Design

Despite the recent successes in robotic locomotion control, the design o...