Gene Expression Time Course Clustering with Countably Infinite Hidden Markov Models

06/27/2012
by   Matthew Beal, et al.
0

Most existing approaches to clustering gene expression time course data treat the different time points as independent dimensions and are invariant to permutations, such as reversal, of the experimental time course. Approaches utilizing HMMs have been shown to be helpful in this regard, but are hampered by having to choose model architectures with appropriate complexities. Here we propose for a clustering application an HMM with a countably infinite state space; inference in this model is possible by recasting it in the hierarchical Dirichlet process (HDP) framework (Teh et al. 2006), and hence we call it the HDP-HMM. We show that the infinite model outperforms model selection methods over finite models, and traditional time-independent methods, as measured by a variety of external and internal indices for clustering on two large publicly available data sets. Moreover, we show that the infinite models utilize more hidden states and employ richer architectures (e.g. state-to-state transitions) without the damaging effects of overfitting.

READ FULL TEXT
research
05/03/2015

A Linear-Time Particle Gibbs Sampler for Infinite Hidden Markov Models

Infinite Hidden Markov Models (iHMM's) are an attractive, nonparametric ...
research
07/21/2017

An Infinite Hidden Markov Model With Similarity-Biased Transitions

We describe a generalization of the Hierarchical Dirichlet Process Hidde...
research
08/18/2018

Bayesian Hidden Markov Tree Models for Clustering Genes with Shared Evolutionary History

Determination of functions for poorly characterized genes is crucial for...
research
11/29/2010

Nonparametric Bayesian sparse factor models with application to gene expression modeling

A nonparametric Bayesian extension of Factor Analysis (FA) is proposed w...
research
02/20/2016

The Segmented iHMM: A Simple, Efficient Hierarchical Infinite HMM

We propose the segmented iHMM (siHMM), a hierarchical infinite hidden Ma...
research
09/12/2018

High-dimensional Bayesian Fourier Analysis For Detecting Circadian Gene Expressions

In genomic applications, there is often interest in identifying genes wh...
research
06/06/2018

On Bayesian inferential tasks with infinite-state jump processes: efficient data augmentation

Advances in sampling schemes for Markov jump processes have recently ena...

Please sign up or login with your details

Forgot password? Click here to reset