A Dual Process Model for Optimizing Cross Entropy in Neural Networks

04/27/2021
by   Stefan Jaeger, et al.
0

Minimizing cross-entropy is a widely used method for training artificial neural networks. Many training procedures based on backpropagation use cross-entropy directly as their loss function. Instead, this theoretical essay investigates a dual process model with two processes, in which one process minimizes the Kullback-Leibler divergence while its dual counterpart minimizes the Shannon entropy. Postulating that learning consists of two dual processes complementing each other, the model defines an equilibrium state for both processes in which the loss function assumes its minimum. An advantage of the proposed model is that it allows deriving the optimal learning rate and momentum weight to update network weights for backpropagation. Furthermore, the model introduces the golden ratio and complex numbers as important new concepts in machine learning.

READ FULL TEXT
research
06/28/2022

On the Rényi Cross-Entropy

The Rényi cross-entropy measure between two distributions, a generalizat...
research
07/12/2021

SoftHebb: Bayesian inference in unsupervised Hebbian soft winner-take-all networks

State-of-the-art artificial neural networks (ANNs) require labelled data...
research
06/08/2020

The Golden Ratio of Learning and Momentum

Gradient descent has been a central training principle for artificial ne...
research
03/22/2022

A Quantitative Comparison between Shannon and Tsallis Havrda Charvat Entropies Applied to Cancer Outcome Prediction

In this paper, we propose to quantitatively compare loss functions based...
research
10/21/2019

Model Order Selection in DoA Scenarios via Cross-Entropy based Machine Learning Techniques

In this paper, we present a machine learning approach for estimating the...
research
11/22/2012

A hybrid cross entropy algorithm for solving dynamic transit network design problem

This paper proposes a hybrid multiagent learning algorithm for solving t...
research
11/12/2019

Combinatorial Models of Cross-Country Dual Meets: What is a Big Victory?

Combinatorial/probabilistic models for cross-country dual-meets are prop...

Please sign up or login with your details

Forgot password? Click here to reset