Bayesian Synthesis of Probabilistic Programs for Automatic Data Modeling

07/14/2019
by   Feras A. Saad, et al.
0

We present new techniques for automatically constructing probabilistic programs for data analysis, interpretation, and prediction. These techniques work with probabilistic domain-specific data modeling languages that capture key properties of a broad class of data generating processes, using Bayesian inference to synthesize probabilistic programs in these modeling languages given observed data. We provide a precise formulation of Bayesian synthesis for automatic data modeling that identifies sufficient conditions for the resulting synthesis procedure to be sound. We also derive a general class of synthesis algorithms for domain-specific languages specified by probabilistic context-free grammars and establish the soundness of our approach for these languages. We apply the techniques to automatically synthesize probabilistic programs for time series data and multivariate tabular data. We show how to analyze the structure of the synthesized programs to compute, for key qualitative properties of interest, the probability that the underlying data generating process exhibits each of these properties. Second, we translate probabilistic programs in the domain-specific language into probabilistic programs in Venture, a general-purpose probabilistic programming system. The translated Venture programs are then executed to obtain predictions of new time series data and new multivariate data records. Experimental results show that our techniques can accurately infer qualitative structure in multiple real-world data sets and outperform standard data analysis methods in forecasting and predicting new data.

READ FULL TEXT
research
07/04/2016

Automatic Generation of Probabilistic Programming from Time Series Data

Probabilistic programming languages represent complex data with intermin...
research
08/18/2016

Probabilistic Data Analysis with Probabilistic Programming

Probabilistic techniques are central to data analysis, but different app...
research
09/12/2022

BayesLDM: A Domain-Specific Language for Probabilistic Modeling of Longitudinal Data

In this paper we present BayesLDM, a system for Bayesian longitudinal da...
research
10/30/2019

Bayesian causal inference via probabilistic program synthesis

Causal inference can be formalized as Bayesian inference that combines a...
research
09/10/2019

Static Analysis for Probabilistic Programs

Probabilistic programming is a powerful abstraction for statistical mach...
research
10/12/2021

Embedded-model flows: Combining the inductive biases of model-free deep learning and explicit probabilistic modeling

Normalizing flows have shown great success as general-purpose density es...
research
04/17/2019

Scalable Verification of Probabilistic Networks

This paper presents McNetKAT, a scalable tool for verifying probabilisti...

Please sign up or login with your details

Forgot password? Click here to reset