Go Small and Similar: A Simple Output Decay Brings Better Performance

06/12/2021
by   Xuan Cheng, et al.
0

Regularization and data augmentation methods have been widely used and become increasingly indispensable in deep learning training. Researchers who devote themselves to this have considered various possibilities. But so far, there has been little discussion about regularizing outputs of the model. This paper begins with empirical observations that better performances are significantly associated with output distributions, that have smaller average values and variances. By audaciously assuming there is causality involved, we propose a novel regularization term, called Output Decay, that enforces the model to assign smaller and similar output values on each class. Though being counter-intuitive, such a small modification result in a remarkable improvement on performance. Extensive experiments demonstrate the wide applicability, versatility, and compatibility of Output Decay.

READ FULL TEXT
research
10/06/2022

A Better Way to Decay: Proximal Gradient Training Algorithms for Neural Nets

Weight decay is one of the most widely used forms of regularization in d...
research
05/25/2022

Augmentation-induced Consistency Regularization for Classification

Deep neural networks have become popular in many supervised learning tas...
research
11/14/2017

Fixing Weight Decay Regularization in Adam

We note that common implementations of adaptive gradient algorithms, suc...
research
04/07/2022

The Effects of Regularization and Data Augmentation are Class Dependent

Regularization is a fundamental technique to prevent over-fitting and to...
research
06/20/2022

When Does Re-initialization Work?

Re-initializing a neural network during training has been observed to im...
research
11/14/2019

Understanding the Disharmony between Weight Normalization Family and Weight Decay: ε-shifted L_2 Regularizer

The merits of fast convergence and potentially better performance of the...
research
03/21/2018

Learning the Localization Function: Machine Learning Approach to Fingerprinting Localization

Considered as a data-driven approach, Fingerprinting Localization Soluti...

Please sign up or login with your details

Forgot password? Click here to reset