Maximum Multiscale Entropy and Neural Network Regularization

06/25/2020
by   Amir R. Asadi, et al.
0

A well-known result across information theory, machine learning, and statistical physics shows that the maximum entropy distribution under a mean constraint has an exponential form called the Gibbs-Boltzmann distribution. This is used for instance in density estimation or to achieve excess risk bounds derived from single-scale entropy regularizers (Xu-Raginsky '17). This paper investigates a generalization of these results to a multiscale setting. We present different ways of generalizing the maximum entropy result by incorporating the notion of scale. For different entropies and arbitrary scale transformations, it is shown that the distribution maximizing a multiscale entropy is characterized by a procedure which has an analogy to the renormalization group procedure in statistical physics. For the case of decimation transformation, it is further shown that this distribution is Gaussian whenever the optimal single-scale distribution is Gaussian. This is then applied to neural networks, and it is shown that in a teacher-student scenario, the multiscale Gibbs posterior can achieve a smaller excess risk than the single-scale Gibbs posterior.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/26/2019

Chaining Meets Chain Rule: Multilevel Entropic Regularization and Training of Neural Nets

We derive generalization and excess risk bounds for neural nets using a ...
research
06/06/2014

Multiscale probability transformation of basic probability assignment

Decision making is still an open issue in the application of Dempster-Sh...
research
12/30/2022

An Entropy-Based Model for Hierarchical Learning

Machine learning is the dominant approach to artificial intelligence, th...
research
04/07/2010

On Tsallis Entropy Bias and Generalized Maximum Entropy Models

In density estimation task, maximum entropy model (Maxent) can effective...
research
12/29/2011

Multi-q Analysis of Image Patterns

This paper studies the use of the Tsallis Entropy versus the classic Bol...
research
01/12/2017

Maximum Entropy Flow Networks

Maximum entropy modeling is a flexible and popular framework for formula...
research
10/07/2021

Physics-inspired analysis of the two-class income distribution in the USA in 1983-2018

The first part of this paper is a brief survey of the approaches to econ...

Please sign up or login with your details

Forgot password? Click here to reset