Probabilistic Relational Model Benchmark Generation

03/02/2016
by   Mouna Ben Ishak, et al.
0

The validation of any database mining methodology goes through an evaluation process where benchmarks availability is essential. In this paper, we aim to randomly generate relational database benchmarks that allow to check probabilistic dependencies among the attributes. We are particularly interested in Probabilistic Relational Models (PRMs), which extend Bayesian Networks (BNs) to a relational data mining context and enable effective and robust reasoning over relational data. Even though a panoply of works have focused, separately , on the generation of random Bayesian networks and relational databases, no work has been identified for PRMs on that track. This paper provides an algorithmic approach for generating random PRMs from scratch to fill this gap. The proposed method allows to generate PRMs as well as synthetic relational data from a randomly generated relational schema and a random set of probabilistic dependencies. This can be of interest not only for machine learning researchers to evaluate their proposals in a common framework, but also for databases designers to evaluate the effectiveness of the components of a database management system.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/11/2016

Relational Models

We provide a survey on relational models. Relational models describe com...
research
09/26/2013

A Sound and Complete Algorithm for Learning Causal Models from Relational Data

The PC algorithm learns maximally oriented causal Bayesian networks. How...
research
12/05/2012

Compiling Relational Database Schemata into Probabilistic Graphical Models

Instead of requiring a domain expert to specify the probabilistic depend...
research
12/12/2012

Discriminative Probabilistic Models for Relational Data

In many supervised learning tasks, the entities to be labeled are relate...
research
11/30/2022

Generating Realistic Synthetic Relational Data through Graph Variational Autoencoders

Synthetic data generation has recently gained widespread attention as a ...
research
03/06/2013

A Construction of Bayesian Networks from Databases Based on an MDL Principle

This paper addresses learning stochastic rules especially on an inter-at...

Please sign up or login with your details

Forgot password? Click here to reset