Learning Hidden Markov Models Using Conditional Samples

02/28/2023
by   Sham M. Kakade, et al.
0

This paper is concerned with the computational complexity of learning the Hidden Markov Model (HMM). Although HMMs are some of the most widely used tools in sequential and time series modeling, they are cryptographically hard to learn in the standard setting where one has access to i.i.d. samples of observation sequences. In this paper, we depart from this setup and consider an interactive access model, in which the algorithm can query for samples from the conditional distributions of the HMMs. We show that interactive access to the HMM enables computationally efficient learning algorithms, thereby bypassing cryptographic hardness. Specifically, we obtain efficient algorithms for learning HMMs in two settings: (a) An easier setting where we have query access to the exact conditional probabilities. Here our algorithm runs in polynomial time and makes polynomially many queries to approximate any HMM in total variation distance. (b) A harder setting where we can only obtain samples from the conditional distributions. Here the performance of the algorithm depends on a new parameter, called the fidelity of the HMM. We show that this captures cryptographically hard instances and previously known positive results. We also show that these results extend to a broader class of distributions with latent low rank structure. Our algorithms can be viewed as generalizations and robustifications of Angluin's L^* algorithm for learning deterministic finite automata from membership queries.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2018

On Computing the Total Variation Distance of Hidden Markov Models

We prove results on the decidability and complexity of computing the tot...
research
06/26/2022

Marginal Inference queries in Hidden Markov Models under context-free grammar constraints

The primary use of any probabilistic model involving a set of random var...
research
06/08/2016

Clustering with Same-Cluster Queries

We propose a framework for Semi-Supervised Active Clustering framework (...
research
05/29/2019

Learning Bayesian Networks with Low Rank Conditional Probability Tables

In this paper, we provide a method to learn the directed structure of a ...
research
07/11/2012

Conditional Chow-Liu Tree Structures for Modeling Discrete-Valued Vector Time Series

We consider the problem of modeling discrete-valued vector time series d...
research
10/06/2009

Reduced-Rank Hidden Markov Models

We introduce the Reduced-Rank Hidden Markov Model (RR-HMM), a generaliza...
research
08/16/2020

Conjunctive Queries: Unique Characterizations and Exact Learnability

We answer the question which conjunctive queries are uniquely characteri...

Please sign up or login with your details

Forgot password? Click here to reset