Learning pairwise Markov network structures using correlation neighborhoods

10/30/2019
by   Juri Kuronen, et al.
0

Markov networks are widely studied and used throughout multivariate statistics and computer science. In particular, the problem of learning the structure of Markov networks from data without invoking chordality assumptions in order to retain expressiveness of the model class has been given a considerable attention in the recent literature, where numerous constraint-based or score-based methods have been introduced. Here we develop a new search algorithm for the network score-optimization that has several computational advantages and scales well to high-dimensional data sets. The key observation behind the algorithm is that the neighborhood of a variable can be efficiently captured using local penalized likelihood ratio (PLR) tests by exploiting an exponential decay of correlations across the neighborhood with an increasing graph-theoretic distance from the focus node. The candidate neighborhoods are then processed by a two-stage hill-climbing (HC) algorithm. Our approach, termed fully as PLRHC-BIC_0.5, compares favorably against the state-of-the-art methods in all our experiments spanning both low- and high-dimensional networks and a wide range of sample sizes. An efficient implementation of PLRHC-BIC_0.5 is freely available from the URL: https://github.com/jurikuronen/plrhc.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2019

High-dimensional structure learning of binary pairwise Markov networks: A comparative numerical study

Learning the undirected graph structure of a Markov network from data is...
research
02/19/2022

Parallel Sampling for Efficient High-dimensional Bayesian Network Structure Learning

Score-based algorithms that learn the structure of Bayesian networks can...
research
12/28/2020

Rao's Score Tests on Correlation Matrices

Even though the Rao's score tests are classical tests, such as the likel...
research
07/23/2023

A Robust Framework for Graph-based Two-Sample Tests Using Weights

Graph-based tests are a class of non-parametric two-sample tests useful ...
research
12/04/2021

Revisiting k-Nearest Neighbor Graph Construction on High-Dimensional Data : Experiments and Analyses

The k-nearest neighbor graph (KNNG) on high-dimensional data is a data s...
research
09/04/2023

Robust penalized least squares of depth trimmed residuals regression for high-dimensional data

Challenges with data in the big-data era include (i) the dimension p is ...
research
08/19/2021

SNIP: An Adaptation of Sorted Neighborhood Methods for Deduplicating Pedigree Data

Pedigree data contain family history information that is used to analyze...

Please sign up or login with your details

Forgot password? Click here to reset