Training Restricted Boltzmann Machines on Word Observations

02/25/2012
by   George E. Dahl, et al.
0

The restricted Boltzmann machine (RBM) is a flexible tool for modeling complex data, however there have been significant computational difficulties in using RBMs to model high-dimensional multinomial observations. In natural language processing applications, words are naturally modeled by K-ary discrete distributions, where K is determined by the vocabulary size and can easily be in the hundreds of thousands. The conventional approach to training RBMs on word observations is limited because it requires sampling the states of K-way softmax visible units during block Gibbs updates, an operation that takes time linear in K. In this work, we address this issue by employing a more general class of Markov chain Monte Carlo operators on the visible units, yielding updates with computational complexity independent of K. We demonstrate the success of our approach by training RBMs on hundreds of millions of word n-grams using larger vocabularies than previously feasible and using the learned features to improve performance on chunking and sentiment classification tasks, achieving state-of-the-art results on the latter.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2017

From Monte Carlo to Las Vegas: Improving Restricted Boltzmann Machine Training Through Stopping Sets

We propose a Las Vegas transformation of Markov Chain Monte Carlo (MCMC)...
research
10/14/2019

Parallelized Training of Restricted Boltzmann Machines using Markov-Chain Monte Carlo Methods

Restricted Boltzmann Machine (RBM) is a generative stochastic neural net...
research
10/10/2016

Accelerate Monte Carlo Simulations with Restricted Boltzmann Machines

Despite their exceptional flexibility and popularity, the Monte Carlo me...
research
05/23/2019

Generative training of quantum Boltzmann machines with hidden units

In this article we provide a method for fully quantum generative trainin...
research
09/21/2016

Matrix Variate RBM Model with Gaussian Distributions

Restricted Boltzmann Machine (RBM) is a particular type of random neural...
research
09/02/2022

Three Learning Stages and Accuracy-Efficiency Tradeoff of Restricted Boltzmann Machines

Restricted Boltzmann Machines (RBMs) offer a versatile architecture for ...

Please sign up or login with your details

Forgot password? Click here to reset