Attention in a family of Boltzmann machines emerging from modern Hopfield networks

12/09/2022
by   Toshihiro Ota, et al.
0

Hopfield networks and Boltzmann machines (BMs) are fundamental energy-based neural network models. Recent studies on modern Hopfield networks have broaden the class of energy functions and led to a unified perspective on general Hopfield networks including an attention module. In this letter, we consider the BM counterparts of modern Hopfield networks using the associated energy functions, and study their salient properties from a trainability perspective. In particular, the energy function corresponding to the attention module naturally introduces a novel BM, which we refer to as attentional BM (AttnBM). We verify that AttnBM has a tractable likelihood function and gradient for a special case and is easy to train. Moreover, we reveal the hidden connections between AttnBM and some single-layer models, namely the Gaussian–Bernoulli restricted BM and denoising autoencoder with softmax units. We also investigate BMs introduced by other energy functions, and in particular, observe that the energy function of dense associative memory models gives BMs belonging to Exponential Family Harmoniums.

READ FULL TEXT
research
08/20/2017

Boltzmann machines and energy-based models

We review Boltzmann machines and energy-based models. A Boltzmann machin...
research
05/15/2020

Minimax formula for the replica symmetric free energy of deep restricted Boltzmann machines

We study the free energy of a most used deep architecture for restricted...
research
12/17/2021

A random energy approach to deep learning

We study a generic ensemble of deep belief networks which is parametrize...
research
05/14/2022

Pattern reconstruction with restricted Boltzmann machines

We show that the ability of a restricted Boltzmann machine to reconstruc...
research
01/16/2013

Metric-Free Natural Gradient for Joint-Training of Boltzmann Machines

This paper introduces the Metric-Free Natural Gradient (MFNG) algorithm ...
research
01/15/2020

Mode-Assisted Unsupervised Learning of Restricted Boltzmann Machines

Restricted Boltzmann machines (RBMs) are a powerful class of generative ...
research
05/11/2023

Investigating the generative dynamics of energy-based neural networks

Generative neural networks can produce data samples according to the sta...

Please sign up or login with your details

Forgot password? Click here to reset