Compacting Frequent Star Patterns in RDF Graphs

03/11/2020
by   Farah Karim, et al.
0

Knowledge graphs have become a popular formalism for representing entities and their properties using a graph data model, e.g., the Resource Description Framework (RDF). An RDF graph comprises entities of the same type connected to objects or other entities using labeled edges annotated with properties. RDF graphs usually contain entities that share the same objects in a certain group of properties, i.e., they match star patterns composed of these properties and objects. In case the number of these entities or properties in these star patterns is large, the size of the RDF graph and query processing are negatively impacted; we refer these star patterns as frequent star patterns. We address the problem of identifying frequent star patterns in RDF graphs and devise the concept of factorized RDF graphs, which denote compact representations of RDF graphs where the number of frequent star patterns is minimized. We also develop computational methods to identify frequent star patterns and generate a factorized RDF graph, where compact RDF molecules replace frequent star patterns. A compact RDF molecule of a frequent star pattern denotes an RDF subgraph that instantiates the corresponding star pattern. Instead of having all the entities matching the original frequent star pattern, a surrogate entity is added and related to the properties of the frequent star pattern; it is linked to the entities that originally match the frequent star pattern. We evaluate the performance of our factorization techniques on several RDF graph benchmarks and compare with a baseline built on top of gSpan, a state-of-the-art algorithm to detect frequent patterns. The outcomes evidence the efficiency of proposed approach and show that our techniques are able to reduce execution time of the baseline approach in at least three orders of magnitude reducing the RDF graph size by up to 66.56

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/24/2019

Intersection graph of maximal stars

A biclique of a graph G is an induced complete bipartite subgraph of G s...
research
04/27/2022

Discovering Representative Attribute-stars via Minimum Description Length

Graphs are a popular data type found in many domains. Numerous technique...
research
02/21/2020

Star Pattern Fragments: Accessing Knowledge Graphs through Star Patterns

The Semantic Web offers access to a vast Web of interlinked information ...
research
10/11/2022

Transforming RDF-star to Property Graphs: A Preliminary Analysis of Transformation Approaches – extended version

RDF and property graph models have many similarities, such as using basi...
research
06/01/2020

Scalable Top-k Query on Information Networks with Hierarchical Inheritance Relations

Graph query, pattern mining and knowledge discovery become challenging o...
research
05/20/2021

On planetary systems as ordered sequences

A planetary system consists of a host star and one or more planets, arra...
research
10/25/2021

Detecting Wandering Behavior of People with Dementia

Wandering is a problematic behavior in people with dementia that can lea...

Please sign up or login with your details

Forgot password? Click here to reset