Wasserstein Training of Boltzmann Machines

07/07/2015
by   Grégoire Montavon, et al.
0

The Boltzmann machine provides a useful framework to learn highly complex, multimodal and multiscale data distributions that occur in the real world. The default method to learn its parameters consists of minimizing the Kullback-Leibler (KL) divergence from training samples to the Boltzmann model. We propose in this work a novel approach for Boltzmann training which assumes that a meaningful metric between observations is given. This metric can be represented by the Wasserstein distance between distributions, for which we derive a gradient with respect to the model parameters. Minimization of this new Wasserstein objective leads to generative models that are better when considering the metric and that have a cluster-like structure. We demonstrate the practical potential of these models for data completion and denoising, for which the metric between observations plays a crucial role.

READ FULL TEXT

page 7

page 8

research
02/03/2020

Machine learning in quantum computers via general Boltzmann Machines: Generative and Discriminative training through annealing

We present a Hybrid-Quantum-classical method for learning Boltzmann mach...
research
01/15/2020

Mode-Assisted Unsupervised Learning of Restricted Boltzmann Machines

Restricted Boltzmann machines (RBMs) are a powerful class of generative ...
research
02/28/2017

Can Boltzmann Machines Discover Cluster Updates ?

Boltzmann machines are physics informed generative models with wide appl...
research
05/19/2018

Generalizing Point Embeddings using the Wasserstein Space of Elliptical Distributions

Embedding complex objects as vectors in low dimensional spaces is a long...
research
09/07/2016

Learning Boltzmann Machine with EM-like Method

We propose an expectation-maximization-like(EMlike) method to train Bolt...
research
01/16/2013

Metric-Free Natural Gradient for Joint-Training of Boltzmann Machines

This paper introduces the Metric-Free Natural Gradient (MFNG) algorithm ...
research
02/17/2022

Full-Span Log-Linear Model and Fast Learning Algorithm

The full-span log-linear(FSLL) model introduced in this paper is conside...

Please sign up or login with your details

Forgot password? Click here to reset