Padé Activation Units: End-to-end Learning of Flexible Activation Functions in Deep Networks

07/15/2019
by   Alejandro Molina, et al.
8

The performance of deep network learning strongly depends on the choice of the non-linear activation function associated with each neuron. However, deciding on the best activation is non-trivial and the choice depends on the architecture, hyper-parameters, and even on the dataset. Typically these activations are fixed by hand before training. Here, we demonstrate how to eliminate the reliance on first picking fixed activation functions by using flexible parametric rational functions instead. The resulting Padé Activation Units (PAUs) can both approximate common activation functions and also learn new ones while providing compact representations. Our empirical evidence shows that end-to-end learning deep networks with PAUs can increase the predictive performance and reduce the training time of common deep architectures. Moreover, PAUs pave the way to approximations with provable robustness. The source code can be found at https://github.com/ml-research/pau

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2021

SMU: smooth activation function for deep networks using smoothing maximum technique

Deep learning researchers have a keen interest in proposing two new nove...
research
08/17/2022

Restructurable Activation Networks

Is it possible to restructure the non-linear activation functions in a d...
research
11/11/2015

Piecewise Linear Activation Functions For More Efficient Deep Networks

This submission has been withdrawn by arXiv administrators because it is...
research
07/26/2020

Regularized Flexible Activation Function Combinations for Deep Neural Networks

Activation in deep neural networks is fundamental to achieving non-linea...
research
02/20/2020

Avoiding Kernel Fixed Points: Computing with ELU and GELU Infinite Networks

Analysing and computing with Gaussian processes arising from infinitely ...
research
05/03/2022

Attentive activation function for improving end-to-end spoofing countermeasure systems

The main objective of the spoofing countermeasure system is to detect th...
research
06/19/2023

Learn to Accumulate Evidence from All Training Samples: Theory and Practice

Evidential deep learning, built upon belief theory and subjective logic,...

Please sign up or login with your details

Forgot password? Click here to reset