Learning Gaussian-Bernoulli RBMs using Difference of Convex Functions Optimization

02/11/2021
by   Vidyadhar Upadhya, et al.
0

The Gaussian-Bernoulli restricted Boltzmann machine (GB-RBM) is a useful generative model that captures meaningful features from the given n-dimensional continuous data. The difficulties associated with learning GB-RBM are reported extensively in earlier studies. They indicate that the training of the GB-RBM using the current standard algorithms, namely, contrastive divergence (CD) and persistent contrastive divergence (PCD), needs a carefully chosen small learning rate to avoid divergence which, in turn, results in slow learning. In this work, we alleviate such difficulties by showing that the negative log-likelihood for a GB-RBM can be expressed as a difference of convex functions if we keep the variance of the conditional distribution of visible units (given hidden unit states) and the biases of the visible units, constant. Using this, we propose a stochastic difference of convex functions (DC) programming (S-DCP) algorithm for learning the GB-RBM. We present extensive empirical studies on several benchmark datasets to validate the performance of this S-DCP algorithm. It is seen that S-DCP is better than the CD and PCD algorithms in terms of speed of learning and the quality of the generative model learnt.

READ FULL TEXT

page 17

page 18

research
09/21/2017

Learning RBM with a DC programming Approach

By exploiting the property that the RBM log-likelihood function is the d...
research
10/25/2018

Efficient Learning of Restricted Boltzmann Machines Using Covariance estimates

Learning of RBMs using standard algorithms such as CD(k) involves gradie...
research
12/20/2013

Stochastic Gradient Estimate Variance in Contrastive Divergence and Persistent Contrastive Divergence

Contrastive Divergence (CD) and Persistent Contrastive Divergence (PCD) ...
research
10/15/2017

Learning Infinite RBMs with Frank-Wolfe

In this work, we propose an infinite restricted Boltzmann machine (RBM),...
research
01/08/2018

Weighted Contrastive Divergence

Learning algorithms for energy based Boltzmann architectures that rely o...
research
07/24/2015

A Neighbourhood-Based Stopping Criterion for Contrastive Divergence Learning

Restricted Boltzmann Machines (RBMs) are general unsupervised learning d...
research
06/12/2019

DCEF: Deep Collaborative Encoder Framework for Unsupervised Clustering

Collaborative representation is a popular feature learning approach, whi...

Please sign up or login with your details

Forgot password? Click here to reset