Stochastic Gradient Estimate Variance in Contrastive Divergence and Persistent Contrastive Divergence

12/20/2013
by   Mathias Berglund, et al.
0

Contrastive Divergence (CD) and Persistent Contrastive Divergence (PCD) are popular methods for training the weights of Restricted Boltzmann Machines. However, both methods use an approximate method for sampling from the model distribution. As a side effect, these approximations yield significantly different biases and variances for stochastic gradient estimates of individual data points. It is well known that CD yields a biased gradient estimate. In this paper we however show empirically that CD has a lower stochastic gradient estimate variance than exact sampling, while the mean of subsequent PCD estimates has a higher variance than exact sampling. The results give one explanation to the finding that CD can be used with smaller minibatches or higher learning rates than PCD.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/06/2015

Population-Contrastive-Divergence: Does Consistency help with RBM training?

Estimating the log-likelihood gradient with respect to the parameters of...
research
01/08/2018

Weighted Contrastive Divergence

Learning algorithms for energy based Boltzmann architectures that rely o...
research
05/25/2023

Which Features are Learnt by Contrastive Learning? On the Role of Simplicity Bias in Class Collapse and Feature Suppression

Contrastive learning (CL) has emerged as a powerful technique for repres...
research
02/11/2021

Learning Gaussian-Bernoulli RBMs using Difference of Convex Functions Optimization

The Gaussian-Bernoulli restricted Boltzmann machine (GB-RBM) is a useful...
research
03/20/2023

Constructing Bayesian Pseudo-Coresets using Contrastive Divergence

Bayesian Pseudo-Coreset (BPC) and Dataset Condensation are two parallel ...
research
10/19/2022

Gaussian-Bernoulli RBMs Without Tears

We revisit the challenging problem of training Gaussian-Bernoulli restri...
research
01/16/2013

Adaptive Importance Sampling for Estimation in Structured Domains

Sampling is an important tool for estimating large, complex sums and int...

Please sign up or login with your details

Forgot password? Click here to reset