Coresets for Dependency Networks

10/09/2017
by   Alejandro Molina, et al.
0

Many applications infer the structure of a probabilistic graphical model from data to elucidate the relationships between variables. But how can we train graphical models on a massive data set? In this paper, we show how to construct coresets -compressed data sets which can be used as proxy for the original data and have provably bounded worst case error- for Gaussian dependency networks (DNs), i.e., cyclic directed graphical models over Gaussians, where the parents of each variable are its Markov blanket. Specifically, we prove that Gaussian DNs admit coresets of size independent of the size of the data set. Unfortunately, this does not extend to DNs over members of the exponential family in general. As we will prove, Poisson DNs do not admit small coresets. Despite this worst-case result, we will provide an argument why our coreset construction for DNs can still work well in practice on count data. To corroborate our theoretical results, we empirically evaluated the resulting Core DNs on real data sets. The results

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/17/2013

On Graphical Models via Univariate Exponential Family Distributions

Undirected graphical models, or Markov networks, are a popular class of ...
research
10/02/2022

Neural Graphical Models

Graphs are ubiquitous and are often used to understand the dynamics of a...
research
01/15/2014

AND/OR Multi-Valued Decision Diagrams (AOMDDs) for Graphical Models

Inspired by the recently introduced framework of AND/OR search spaces fo...
research
02/14/2016

Identifiability Assumptions and Algorithm for Directed Graphical Models with Feedback

Directed graphical models provide a useful framework for modeling causal...
research
09/22/2021

Decentralized Learning of Tree-Structured Gaussian Graphical Models from Noisy Data

This paper studies the decentralized learning of tree-structured Gaussia...
research
05/30/2021

Approximate Implication with d-Separation

The graphical structure of Probabilistic Graphical Models (PGMs) encodes...
research
11/15/2016

Unsupervised Learning with Truncated Gaussian Graphical Models

Gaussian graphical models (GGMs) are widely used for statistical modelin...

Please sign up or login with your details

Forgot password? Click here to reset