Spectral Learning of Large Structured HMMs for Comparative Epigenomics

06/04/2015
by   Chicheng Zhang, et al.
0

We develop a latent variable model and an efficient spectral algorithm motivated by the recent emergence of very large data sets of chromatin marks from multiple human cell types. A natural model for chromatin data in one cell type is a Hidden Markov Model (HMM); we model the relationship between multiple cell types by connecting their hidden states by a fixed tree of known structure. The main challenge with learning parameters of such models is that iterative methods such as EM are very slow, while naive spectral methods result in time and space complexity exponential in the number of cell types. We exploit properties of the tree structure of the hidden states to provide spectral algorithms that are more computationally efficient for current biological datasets. We provide sample complexity bounds for our algorithm and evaluate it experimentally on biological data from nine human cell types. Finally, we show that beyond our specific model, some of our algorithmic ideas can be applied to other graphical models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2018

Spectral Learning of Binomial HMMs for DNA Methylation Data

We consider learning parameters of Binomial Hidden Markov Models, which ...
research
03/28/2012

Spectral dimensionality reduction for HMMs

Hidden Markov Models (HMMs) can be accurately approximated using co-occu...
research
07/12/2014

A Spectral Algorithm for Inference in Hidden Semi-Markov Models

Hidden semi-Markov models (HSMMs) are latent variable models which allow...
research
09/21/2016

Learning HMMs with Nonparametric Emissions via Spectral Decompositions of Continuous Matrices

Recently, there has been a surge of interest in using spectral methods f...
research
06/27/2019

A Bayesian Phylogenetic Hidden Markov Model for B Cell Receptor Sequence Analysis

The human body is able to generate a diverse set of high affinity antibo...
research
01/25/2019

Finding Archetypal Spaces for Data Using Neural Networks

Archetypal analysis is a type of factor analysis where data is fit by a ...
research
05/24/2018

Geographical Hidden Markov Tree for Flood Extent Mapping (With Proof Appendix)

Flood extent mapping plays a crucial role in disaster management and nat...

Please sign up or login with your details

Forgot password? Click here to reset