Aggregated Wasserstein Metric and State Registration for Hidden Markov Models

11/12/2017
by   Yukun Chen, et al.
0

We propose a framework, named Aggregated Wasserstein, for computing a dissimilarity measure or distance between two Hidden Markov Models with state conditional distributions being Gaussian. For such HMMs, the marginal distribution at any time position follows a Gaussian mixture distribution, a fact exploited to softly match, aka register, the states in two HMMs. We refer to such HMMs as Gaussian mixture model-HMM (GMM-HMM). The registration of states is inspired by the intrinsic relationship of optimal transport and the Wasserstein metric between distributions. Specifically, the components of the marginal GMMs are matched by solving an optimal transport problem where the cost between components is the Wasserstein metric for Gaussian distributions. The solution of the optimization problem is a fast approximation to the Wasserstein metric between two GMMs. The new Aggregated Wasserstein distance is a semi-metric and can be computed without generating Monte Carlo samples. It is invariant to relabeling or permutation of states. The distance is defined meaningfully even for two HMMs that are estimated from data of different dimensionality, a situation that can arise due to missing variables. This distance quantifies the dissimilarity of GMM-HMMs by measuring both the difference between the two marginal GMMs and that between the two transition matrices. Our new distance is tested on tasks of retrieval, classification, and t-SNE visualization of time series. Experiments on both synthetic and real data have demonstrated its advantages in terms of accuracy as well as efficiency in comparison with existing distances based on the Kullback-Leibler divergence.

READ FULL TEXT

page 9

page 13

page 17

research
08/05/2016

A Distance for HMMs based on Aggregated Wasserstein Metric and State Registration

We propose a framework, named Aggregated Wasserstein, for computing a di...
research
01/24/2020

Gaussian-Smooth Optimal Transport: Metric Structure and Statistical Efficiency

Optimal transport (OT), and in particular the Wasserstein distance, has ...
research
07/26/2022

Sliced Wasserstein Variational Inference

Variational Inference approximates an unnormalized distribution via the ...
research
08/20/2023

Wasserstein Geodesic Generator for Conditional Distributions

Generating samples given a specific label requires estimating conditiona...
research
12/30/2022

Estimating Latent Population Flows from Aggregated Data via Inversing Multi-Marginal Optimal Transport

We study the problem of estimating latent population flows from aggregat...
research
10/13/2021

Dynamical Wasserstein Barycenters for Time-series Modeling

Many time series can be modeled as a sequence of segments representing h...
research
07/12/2023

Distribution-on-Distribution Regression with Wasserstein Metric: Multivariate Gaussian Case

Distribution data refers to a data set where each sample is represented ...

Please sign up or login with your details

Forgot password? Click here to reset