Spectral Top-Down Recovery of Latent Tree Models

02/26/2021
by   Yariv Aizenbud, et al.
0

Modeling the distribution of high dimensional data by a latent tree graphical model is a common approach in multiple scientific domains. A common task is to infer the underlying tree structure given only observations of the terminal nodes. Many algorithms for tree recovery are computationally intensive, which limits their applicability to trees of moderate size. For large trees, a common approach, termed divide-and-conquer, is to recover the tree structure in two steps. First, recover the structure separately for multiple randomly selected subsets of the terminal nodes. Second, merge the resulting subtrees to form a full tree. Here, we develop Spectral Top-Down Recovery (STDR), a divide-and-conquer approach for inference of large latent tree models. Unlike previous methods, STDR's partitioning step is non-random. Instead, it is based on the Fiedler vector of a suitable Laplacian matrix related to the observed nodes. We prove that under certain conditions this partitioning is consistent with the tree structure. This, in turn leads to a significantly simpler merging procedure of the small subtrees. We prove that STDR is statistically consistent, and bound the number of samples required to accurately recover the tree with high probability. Using simulated data from several common tree models in phylogenetics, we demonstrate that STDR has a significant advantage in terms of runtime, with improved or similar accuracy.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/28/2020

Spectral neighbor joining for reconstruction of latent tree models

A key assumption in multiple scientific applications is that the distrib...
research
09/14/2010

Learning Latent Tree Graphical Models

We study the problem of learning a latent tree graphical model where sam...
research
01/22/2021

SGA: A Robust Algorithm for Partial Recovery of Tree-Structured Graphical Models with Noisy Samples

We consider learning Ising tree models when the observations from the no...
research
08/03/2022

OrderedCuts: A new approach for computing Gomory-Hu tree

The Gomory-Hu tree, or a cut tree, is a classic data structure that stor...
research
03/22/2019

The Binary Space Partitioning-Tree Process

The Mondrian process represents an elegant and powerful approach for spa...
research
03/16/2020

Randomly Aggregated Least Squares for Support Recovery

We study the problem of exact support recovery: given an (unknown) vecto...
research
09/20/2016

An Efficient Method of Partitioning High Volumes of Multidimensional Data for Parallel Clustering Algorithms

An optimal data partitioning in parallel & distributed implementation of...

Please sign up or login with your details

Forgot password? Click here to reset