Empirical Loss Landscape Analysis of Neural Network Activation Functions

06/28/2023
by   Anna Sergeevna Bosman, et al.
0

Activation functions play a significant role in neural network design by enabling non-linearity. The choice of activation function was previously shown to influence the properties of the resulting loss landscape. Understanding the relationship between activation functions and loss landscape properties is important for neural architecture and training algorithm design. This study empirically investigates neural network loss landscapes associated with hyperbolic tangent, rectified linear unit, and exponential linear unit activation functions. Rectified linear unit is shown to yield the most convex loss landscape, and exponential linear unit is shown to yield the least flat loss landscape, and to exhibit superior generalisation performance. The presence of wide and narrow valleys in the loss landscape is established for all activation functions, and the narrow valleys are shown to correlate with saturated neurons and implicitly regularised network configurations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/05/2021

An Analysis of State-of-the-art Activation Functions For Supervised Deep Neural Network

This paper provides an analysis of state-of-the-art activation functions...
research
10/03/2018

Weighted Sigmoid Gate Unit for an Activation Function of Deep Neural Network

An activation function has crucial role in a deep neural network. A si...
research
11/11/2020

Domain Wall Leaky Integrate-and-Fire Neurons with Shape-Based Configurable Activation Functions

Complementary metal oxide semiconductor (CMOS) devices display volatile ...
research
02/07/2020

Ill-Posedness and Optimization Geometry for Nonlinear Neural Network Training

In this work we analyze the role nonlinear activation functions play at ...
research
04/14/2023

The R-mAtrIx Net

We provide a novel Neural Network architecture that can: i) output R-mat...
research
06/07/2013

Non-constant bounded holomorphic functions of hyperbolic numbers - Candidates for hyperbolic activation functions

The Liouville theorem states that bounded holomorphic complex functions ...
research
01/16/2013

Saturating Auto-Encoders

We introduce a simple new regularizer for auto-encoders whose hidden-uni...

Please sign up or login with your details

Forgot password? Click here to reset