Efficient Evaluation of the Partition Function of RBMs with Annealed Importance Sampling

07/23/2020
by   Ferran Mazzanti, et al.
0

Probabilistic models based on Restricted Boltzmann Machines (RBMs) imply the evaluation of normalized Boltzmann factors, which in turn require from the evaluation of the partition function Z. The exact evaluation of Z, though, becomes a forbiddingly expensive task as the system size increases. This even worsens when one considers most usual learning algorithms for RBMs, where the exact evaluation of the gradient of the log-likelihood of the empirical distribution of the data includes the computation of Z at each iteration. The Annealed Importance Sampling (AIS) method provides a tool to stochastically estimate the partition function of the system. So far, the standard use of the AIS algorithm in the Machine Learning context has been done using a large number of Monte Carlo steps. In this work we show that this may not be required if a proper starting probability distribution is employed as the initialization of the AIS algorithm. We analyze the performance of AIS in both small- and large-sized problems, and show that in both cases a good estimation of Z can be obtained with little computational cost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/08/2022

Free Energy Evaluation Using Marginalized Annealed Importance Sampling

The evaluation of the free energy of a stochastic model is considered to...
research
03/07/2016

Partition Functions from Rao-Blackwellized Tempered Sampling

Partition functions of probability distributions are important quantitie...
research
06/14/2019

Empirical Bayes Method for Boltzmann Machines

In this study, we consider an empirical Bayes method for Boltzmann machi...
research
03/15/2017

A New Unbiased and Efficient Class of LSH-Based Samplers and Estimators for Partition Function Computation in Log-Linear Models

Log-linear models are arguably the most successful class of graphical mo...
research
01/08/2018

Weighted Contrastive Divergence

Learning algorithms for energy based Boltzmann architectures that rely o...
research
09/10/2019

Inverse Ising inference from high-temperature re-weighting of observations

Maximum Likelihood Estimation (MLE) is the bread and butter of system in...
research
05/09/2012

Products of Hidden Markov Models: It Takes N>1 to Tango

Products of Hidden Markov Models(PoHMMs) are an interesting class of gen...

Please sign up or login with your details

Forgot password? Click here to reset