Using Inherent Structures to design Lean 2-layer RBMs

06/12/2018
by   Abhishek Bansal, et al.
0

Understanding the representational power of Restricted Boltzmann Machines (RBMs) with multiple layers is an ill-understood problem and is an area of active research. Motivated from the approach of Inherent Structure formalism (Stillinger & Weber, 1982), extensively used in analysing Spin Glasses, we propose a novel measure called Inherent Structure Capacity (ISC), which characterizes the representation capacity of a fixed architecture RBM by the expected number of modes of distributions emanating from the RBM with parameters drawn from a prior distribution. Though ISC is intractable, we show that for a single layer RBM architecture ISC approaches a finite constant as number of hidden units are increased and to further improve the ISC, one needs to add a second layer. Furthermore, we introduce Lean RBMs, which are multi-layer RBMs where each layer can have at-most O(n) units with the number of visible units being n. We show that for every single layer RBM with Ω(n^2+r), r > 0, hidden units there exists a two-layered lean RBM with Θ(n^2) parameters with the same ISC, establishing that 2 layer RBMs can achieve the same representational power as single-layer RBMs but using far fewer number of parameters. To the best of our knowledge, this is the first result which quantitatively establishes the need for layering.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/16/2023

Structural Restricted Boltzmann Machine for image denoising and classification

Restricted Boltzmann Machines are generative models that consist of a la...
research
02/09/2015

An Infinite Restricted Boltzmann Machine

We present a mathematical construction for the restricted Boltzmann mach...
research
02/21/2008

Testing the number of parameters with multidimensional MLP

This work concerns testing the number of parameters in one hidden layer ...
research
12/05/2018

Self-learning Local Supervision Encoding Framework to Constrict and Disperse Feature Distribution for Clustering

To obtain suitable feature distribution is a difficult task in machine l...
research
01/24/2020

Effectively Trainable Semi-Quantum Restricted Boltzmann Machine

We propose a novel quantum model for the restricted Boltzmann machine (R...
research
08/24/2017

On the Compressive Power of Deep Rectifier Networks for High Resolution Representation of Class Boundaries

This paper provides a theoretical justification of the superior classifi...
research
11/06/2015

The Poisson Gamma Belief Network

To infer a multilayer representation of high-dimensional count vectors, ...

Please sign up or login with your details

Forgot password? Click here to reset