Batch Stationary Distribution Estimation

03/02/2020
by   Junfeng Wen, et al.
7

We consider the problem of approximating the stationary distribution of an ergodic Markov chain given a set of sampled transitions. Classical simulation-based approaches assume access to the underlying process so that trajectories of sufficient length can be gathered to approximate stationary sampling. Instead, we consider an alternative setting where a fixed set of transitions has been collected beforehand, by a separate, possibly unknown procedure. The goal is still to estimate properties of the stationary distribution, but without additional access to the underlying system. We propose a consistent estimator that is based on recovering a correction ratio function over the given data. In particular, we develop a variational power method (VPM) that provides provably consistent estimates under general conditions. In addition to unifying a number of existing approaches from different subfields, we also find that VPM yields significantly better estimates across a range of problems, including queueing, stochastic differential equations, post-processing MCMC, and off-policy evaluation.

READ FULL TEXT

page 10

page 19

research
02/21/2020

GenDICE: Generalized Offline Estimation of Stationary Values

An important problem that arises in reinforcement learning and Monte Car...
research
01/17/2023

On the Statistical Consistency of a Generalized Cepstral Estimator

We consider the problem to estimate the generalized cepstral coefficient...
research
10/02/2020

MCMC-Interactive Variational Inference

Leveraging well-established MCMC strategies, we propose MCMC-interactive...
research
05/03/2023

Streaming PCA for Markovian Data

Since its inception in Erikki Oja's seminal paper in 1982, Oja's algorit...
research
02/21/2020

Stochastic Normalizing Flows

We introduce stochastic normalizing flows, an extension of continuous no...
research
01/25/2020

The reproducing Stein kernel approach for post-hoc corrected sampling

Stein importance sampling is a widely applicable technique based on kern...
research
10/04/2020

High-dimensional Gaussian sampling: a review and a unifying approach based on a stochastic proximal point algorithm

Efficient sampling from a high-dimensional Gaussian distribution is an o...

Please sign up or login with your details

Forgot password? Click here to reset