Fast and Functional Structured Data Generators Rooted in Out-of-Equilibrium Physics

07/13/2023
by   Alessandra Carbone, et al.
0

In this study, we address the challenge of using energy-based models to produce high-quality, label-specific data in complex structured datasets, such as population genetics, RNA or protein sequences data. Traditional training methods encounter difficulties due to inefficient Markov chain Monte Carlo mixing, which affects the diversity of synthetic data and increases generation times. To address these issues, we use a novel training algorithm that exploits non-equilibrium effects. This approach, applied on the Restricted Boltzmann Machine, improves the model's ability to correctly classify samples and generate high-quality synthetic data in only a few sampling steps. The effectiveness of this method is demonstrated by its successful application to four different types of data: handwritten digits, mutations of human genomes classified by continental origin, functionally characterized sequences of an enzyme protein family, and homologous RNA sequences from specific taxonomies.

READ FULL TEXT

page 3

page 5

page 13

page 15

research
06/02/2022

Learning a Restricted Boltzmann Machine using biased Monte Carlo sampling

Restricted Boltzmann Machines are simple and powerful generative models ...
research
02/20/2020

A table of short-period Tausworthe generators for Markov chain quasi-Monte Carlo

We consider the problem of estimating expectations by using Markov chain...
research
07/14/2023

Training Discrete Energy-Based Models with Energy Discrepancy

Training energy-based models (EBMs) on discrete spaces is challenging be...
research
10/14/2019

Parallelized Training of Restricted Boltzmann Machines using Markov-Chain Monte Carlo Methods

Restricted Boltzmann Machine (RBM) is a generative stochastic neural net...
research
01/23/2023

Explaining the effects of non-convergent sampling in the training of Energy-Based Models

In this paper, we quantify the impact of using non-convergent Markov cha...
research
05/28/2021

Equilibrium and non-Equilibrium regimes in the learning of Restricted Boltzmann Machines

Training Restricted Boltzmann Machines (RBMs) has been challenging for a...
research
03/19/2023

A generalization of short-period Tausworthe generators and its application to Markov chain quasi-Monte Carlo

A one-dimensional sequence u_0, u_1, u_2, …∈ [0, 1) is said to be comple...

Please sign up or login with your details

Forgot password? Click here to reset