Clustering-Enhanced Stochastic Gradient MCMC for Hidden Markov Models with Rare States

10/31/2018
by   Rihui Ou, et al.
0

MCMC algorithms for hidden Markov models, which often rely on the forward-backward sampler, suffer with large sample size due to the temporal dependence inherent in the data. Recently, a number of approaches have been developed for posterior inference which make use of the mixing of the hidden Markov process to approximate the full posterior by using small chunks of the data. However, in the presence of imbalanced data resulting from rare latent states, the proposed minibatch estimates will often exclude rare state data resulting in poor inference of the associated emission parameters and inaccurate prediction or detection of rare events. Here, we propose to use a preliminary clustering to over-sample the rare clusters and reduce variance in gradient estimation within Stochastic Gradient MCMC. We demonstrate very substantial gains in predictive and inferential accuracy on real and synthetic examples.

READ FULL TEXT
research
06/14/2017

Stochastic Gradient MCMC Methods for Hidden Markov Models

Stochastic gradient MCMC (SG-MCMC) algorithms have proven useful in scal...
research
01/29/2019

Stochastic Gradient MCMC for Nonlinear State Space Models

State space models (SSMs) provide a flexible framework for modeling comp...
research
10/28/2022

Preferential Subsampling for Stochastic Gradient Langevin Dynamics

Stochastic gradient MCMC (SGMCMC) offers a scalable alternative to tradi...
research
10/17/2017

Estimate exponential memory decay in Hidden Markov Model and its applications

Inference in hidden Markov model has been challenging in terms of scalab...
research
12/18/2022

Pigeonhole Stochastic Gradient Langevin Dynamics for Large Crossed Mixed Effects Models

Large crossed mixed effects models with imbalanced structures and missin...
research
12/17/2020

DenseHMM: Learning Hidden Markov Models by Learning Dense Representations

We propose DenseHMM - a modification of Hidden Markov Models (HMMs) that...
research
04/01/2022

Bayesian Non-Homogeneous Hidden Markov Model with Variable Selection for Investigating Drivers of Seizure Risk Cycling

A major issue in the clinical management of epilepsy is the unpredictabi...

Please sign up or login with your details

Forgot password? Click here to reset