Population-Contrastive-Divergence: Does Consistency help with RBM training?

10/06/2015
by   Oswin Krause, et al.
0

Estimating the log-likelihood gradient with respect to the parameters of a Restricted Boltzmann Machine (RBM) typically requires sampling using Markov Chain Monte Carlo (MCMC) techniques. To save computation time, the Markov chains are only run for a small number of steps, which leads to a biased estimate. This bias can cause RBM training algorithms such as Contrastive Divergence (CD) learning to deteriorate. We adopt the idea behind Population Monte Carlo (PMC) methods to devise a new RBM training algorithm termed Population-Contrastive-Divergence (pop-CD). Compared to CD, it leads to a consistent estimate and may have a significantly lower bias. Its computational overhead is negligible compared to CD. However, the variance of the gradient estimate increases. We experimentally show that pop-CD can significantly outperform CD. In many cases, we observed a smaller bias and achieved higher log-likelihood values. However, when the RBM distribution has many hidden neurons, the consistent estimate of pop-CD may still have a considerable bias and the variance of the gradient estimate requires a smaller learning rate. Thus, despite its superior theoretical properties, it is not advisable to use pop-CD in its current form on large problems.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/07/2020

Training and Classification using a Restricted Boltzmann Machine on the D-Wave 2000Q

Restricted Boltzmann Machine (RBM) is an energy based, undirected graphi...
research
12/20/2013

Stochastic Gradient Estimate Variance in Contrastive Divergence and Persistent Contrastive Divergence

Contrastive Divergence (CD) and Persistent Contrastive Divergence (PCD) ...
research
04/16/2014

Speeding Up MCMC by Efficient Data Subsampling

We propose Subsampling MCMC, a Markov Chain Monte Carlo (MCMC) framework...
research
10/03/2019

Efficient training of energy-based models via spin-glass control

We present an efficient method for unsupervised learning using Boltzmann...
research
07/24/2015

A Neighbourhood-Based Stopping Criterion for Contrastive Divergence Learning

Restricted Boltzmann Machines (RBMs) are general unsupervised learning d...
research
09/21/2017

Learning RBM with a DC programming Approach

By exploiting the property that the RBM log-likelihood function is the d...
research
05/28/2021

Equilibrium and non-Equilibrium regimes in the learning of Restricted Boltzmann Machines

Training Restricted Boltzmann Machines (RBMs) has been challenging for a...

Please sign up or login with your details

Forgot password? Click here to reset