Scalable Probabilistic Databases with Factor Graphs and MCMC

05/11/2010
by   Michael Wick, et al.
0

Probabilistic databases play a crucial role in the management and understanding of uncertain data. However, incorporating probabilities into the semantics of incomplete databases has posed many challenges, forcing systems to sacrifice modeling power, scalability, or restrict the class of relational algebra formula under which they are closed. We propose an alternative approach where the underlying relational database always represents a single world, and an external factor graph encodes a distribution over possible worlds; Markov chain Monte Carlo (MCMC) inference is then used to recover this uncertainty to a desired level of fidelity. Our approach allows the efficient evaluation of arbitrary queries over probabilistic databases with arbitrary dependencies expressed by graphical models with structure that changes during inference. MCMC sampling provides efficiency by hypothesizing modifications to possible worlds rather than generating entire worlds from scratch. Queries are then run over the portions of the world that change, avoiding the onerous cost of running full queries over each sampled world. A significant innovation of this work is the connection between MCMC sampling and materialized view maintenance techniques: we find empirically that using view maintenance techniques is several orders of magnitude faster than naively querying each sampled world. We also demonstrate our system's ability to answer relational queries with aggregation, and demonstrate additional scalability through the use of parallelization.

READ FULL TEXT
research
02/24/2020

Symbolic Querying of Vector Spaces: Probabilistic Databases Meets Relational Embeddings

To deal with increasing amounts of uncertainty and incompleteness in rel...
research
06/19/2018

Reducing Property Graph Queries to Relational Algebra for Incremental View Maintenance

The property graph data model of modern graph database systems is increa...
research
12/01/2014

Lifted Probabilistic Inference for Asymmetric Graphical Models

Lifted probabilistic inference algorithms have been successfully applied...
research
07/05/2018

Incremental Relational Lenses

Lenses are a popular approach to bidirectional transformations, a genera...
research
03/24/2014

Adaptive MCMC-Based Inference in Probabilistic Logic Programs

Probabilistic Logic Programming (PLP) languages enable programmers to sp...
research
02/13/2023

Incremental Consistent Updating of Incomplete Databases

Efficient consistency maintenance of incomplete and dynamic real-life da...
research
03/12/2019

Generating and Sampling Orbits for Lifted Probabilistic Inference

Lifted inference scales to large probability models by exploiting symmet...

Please sign up or login with your details

Forgot password? Click here to reset