Gaussian Graphical Model Selection for Huge Data via Minipatch Learning

10/22/2021
by   Tianyi Yao, et al.
0

Gaussian graphical models are essential unsupervised learning techniques to estimate conditional dependence relationships between sets of nodes. While graphical model selection is a well-studied problem with many popular techniques, there are typically three key practical challenges: i) many existing methods become computationally intractable in huge-data settings with tens of thousands of nodes; ii) the need for separate data-driven tuning hyperparameter selection procedures considerably adds to the computational burden; iii) the statistical accuracy of selected edges often deteriorates as the dimension and/or the complexity of the underlying graph structures increase. We tackle these problems by proposing the Minipatch Graph (MPGraph) estimator. Our approach builds upon insights from the latent variable graphical model problem and utilizes ensembles of thresholded graph estimators fit to tiny, random subsets of both the observations and the nodes, termed minipatches. As estimates are fit on small problems, our approach is computationally fast with integrated stability-based hyperparameter tuning. Additionally, we prove that under certain conditions our MPGraph algorithm achieves finite-sample graph selection consistency. We compare our approach to state-of-the-art computational approaches to Gaussian graphical model selection including the BigQUIC algorithm, and empirically demonstrate that our approach is not only more accurate but also extensively faster for huge graph selection problems.

READ FULL TEXT
research
10/16/2020

Feature Selection for Huge Data via Minipatch Learning

Feature selection often leads to increased model interpretability, faste...
research
11/05/2012

Rejoinder: Latent variable graphical model selection via convex optimization

Rejoinder to "Latent variable graphical model selection via convex optim...
research
11/05/2012

Discussion: Latent variable graphical model selection via convex optimization

Discussion of "Latent variable graphical model selection via convex opti...
research
02/04/2021

Graph Coding for Model Selection and Anomaly Detection in Gaussian Graphical Models

A classic application of description length is for model selection with ...
research
07/27/2022

Graphical Gaussian models associated to a homogeneous graph with permutation symmetries

We consider multivariate centered Gaussian models for the random vector ...
research
04/13/2021

Thresholded Graphical Lasso Adjusts for Latent Variables: Application to Functional Neural Connectivity

In neuroscience, researchers seek to uncover the connectivity of neurons...
research
03/12/2015

Qualitative inequalities for squared partial correlations of a Gaussian random vector

We describe various sets of conditional independence relationships, suff...

Please sign up or login with your details

Forgot password? Click here to reset