The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems

02/16/2023
by   Indaco Biazzo, et al.
0

Generative Autoregressive Neural Networks (ARNN) have recently demonstrated exceptional results in image and language generation tasks, contributing to the growing popularity of generative models in both scientific and commercial applications. This work presents a physical interpretation of the ARNNs by reformulating the Boltzmann distribution of binary pairwise interacting systems into autoregressive form. The resulting ARNN architecture has weights and biases of its first layer corresponding to the Hamiltonian's couplings and external fields, featuring widely used structures like the residual connections and a recurrent architecture with clear physical meanings. However, the exponential growth, with system size, of the number of parameters of the hidden layers makes its direct application unfeasible. Nevertheless, its architecture's explicit formulation allows using statistical physics techniques to derive new ARNNs for specific systems. As examples, new effective ARNN architectures are derived from two well-known mean-field systems, the Curie-Weiss and Sherrington-Kirkpatrick models, showing superior performances in approximating the Boltzmann distributions of the corresponding physics model than other commonly used ARNNs architectures. The connection established between the physics of the system and the ARNN architecture provides a way to derive new neural network architectures for different interacting systems and interpret existing ones from a physical perspective.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2017

Deep Learning the Ising Model Near Criticality

It is well established that neural networks with deep architectures perf...
research
09/27/2018

Solving Statistical Mechanics using Variational Autoregressive Networks

We propose a general framework for solving statistical mechanics of syst...
research
07/11/2023

Monotone deep Boltzmann machines

Deep Boltzmann machines (DBMs), one of the first “deep” learning methods...
research
02/28/2017

Can Boltzmann Machines Discover Cluster Updates ?

Boltzmann machines are physics informed generative models with wide appl...
research
09/05/2023

Inferring effective couplings with Restricted Boltzmann Machines

Generative models offer a direct way to model complex data. Among them, ...
research
01/15/2020

Learning the Ising Model with Generative Neural Networks

Recent advances in deep learning and neural networks have led to an incr...
research
09/14/2022

Optimal Connectivity through Network Gradients for the Restricted Boltzmann Machine

Leveraging sparse networks to connect successive layers in deep neural n...

Please sign up or login with your details

Forgot password? Click here to reset