Towards a property graph generator for benchmarking

04/03/2017
by   Arnau Prat-Pérez, et al.
0

The use of synthetic graph generators is a common practice among graph-oriented benchmark designers, as it allows obtaining graphs with the required scale and characteristics. However, finding a graph generator that accurately fits the needs of a given benchmark is very difficult, thus practitioners end up creating ad-hoc ones. Such a task is usually time-consuming, and often leads to reinventing the wheel. In this paper, we introduce the conceptual design of DataSynth, a framework for property graphs generation with customizable schemas and characteristics. The goal of DataSynth is to assist benchmark designers in generating graphs efficiently and at scale, saving from implementing their own generators. Additionally, DataSynth introduces novel features barely explored so far, such as modeling the correlation between properties and the structure of the graph. This is achieved by a novel property-to-node matching algorithm for which we present preliminary promising results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/07/2022

AgraSSt: Approximate Graph Stein Statistics for Interpretable Assessment of Implicit Graph Generators

We propose and analyse a novel statistical procedure, coined AgraSSt, to...
research
07/17/2023

Examining the Effects of Degree Distribution and Homophily in Graph Learning Models

Despite a surge in interest in GNN development, homogeneity in benchmark...
research
08/29/2023

ACER: An AST-based Call Graph Generator Framework

We introduce ACER, an AST-based call graph generator framework. ACER lev...
research
04/04/2022

SPECTRE : Spectral Conditioning Helps to Overcome the Expressivity Limits of One-shot Graph Generators

We approach the graph generation problem from a spectral perspective by ...
research
08/09/2023

Data-driven Intra-Autonomous Systems Graph Generator

This paper introduces a novel deep-learning based generator of synthetic...
research
01/22/2020

Graph Generators: State of the Art and Open Challenges

The abundance of interconnected data has fueled the design and implement...
research
07/03/2019

A Software Framework and Datasets for the Analysis of Graph Measures on RDF Graphs

As the availability and the inter-connectivity of RDF datasets grow, so ...

Please sign up or login with your details

Forgot password? Click here to reset