Communication-Avoiding Optimization Methods for Massive-Scale Graphical Model Structure Learning

10/30/2017
by   Penporn Koanantakool, et al.
0

Undirected graphical models compactly represent the structure of large, high-dimensional data sets, which are especially important in interpreting complex scientific data. Some data sets may run to multiple terabytes, and current methods are intractable in both memory size and running time. We introduce HP-CONCORD, a highly scalable optimization algorithm to estimate a sparse inverse covariance matrix based on a regularized pseudolikelihood framework. Our parallel proximal gradient method runs across a multi-node cluster and achieves parallel scalability using a novel communication-avoiding linear algebra algorithm. We demonstrate scalability on problems with 1.28 million dimensions (over 800 billion parameters) and show that it can outperform a previous method on a single node and scales to 1K nodes (24K cores). We use HP-CONCORD to estimate the underlying conditional dependency structure of the brain from fMRI data and use the result to automatically identify functional regions. The results show good agreement with a state-of-the-art clustering from the neuroscience literature.

READ FULL TEXT

page 23

page 24

page 25

page 26

page 27

page 28

page 29

page 30

research
09/15/2015

Large-Scale Optimization Algorithms for Sparse Conditional Gaussian Graphical Models

This paper addresses the problem of scalable optimization for L1-regular...
research
02/14/2018

Linear-Time Algorithm for Learning Large-Scale Sparse Graphical Models

The sparse inverse covariance estimation problem is commonly solved usin...
research
10/05/2019

Clustering Gaussian Graphical Models

We derive an efficient method to perform clustering of nodes in Gaussian...
research
05/06/2021

High-dimensional Functional Graphical Model Structure Learning via Neighborhood Selection Approach

Undirected graphical models have been widely used to model the condition...
research
03/19/2014

A Hierarchical Graphical Model for Big Inverse Covariance Estimation with an Application to fMRI

Brain networks has attracted the interests of many neuroscientists. From...
research
08/14/2022

Virgo: Scalable Unsupervised Classification of Cosmological Shock Waves

Cosmological shock waves are essential to understanding the formation of...
research
11/11/2020

Sketch and Scale: Geo-distributed tSNE and UMAP

Running machine learning analytics over geographically distributed datas...

Please sign up or login with your details

Forgot password? Click here to reset