Towards Improving Generalization of Deep Networks via Consistent Normalization

08/31/2019
by   Aojun Zhou, et al.
0

Batch Normalization (BN) was shown to accelerate training and improve generalization of Convolutional Neural Networks (ConvNets), which typically use the Conv-BN couple as building block. However, this work shows a common phenomenon that the Conv-BN module does not necessarily outperform the networks trained without using BN, especially when data augmentation is presented in training. We find that this phenomenon occurs because there is inconsistency between the distribution of the augmented data and that of the normalized representation. To address this issue, we propose Consistent Normalization (CN) that not only retains the advantages of the existing normalization methods, but also achieves state-of-the-art performance on various tasks including image classification, segmentation, and machine translation. The code will be released to facilitate reproducibility.

READ FULL TEXT
research
11/21/2019

Rethinking Normalization and Elimination Singularity in Neural Networks

In this paper, we study normalization methods for neural networks from t...
research
03/25/2019

Weight Standardization

In this paper, we propose Weight Standardization (WS) to accelerate deep...
research
02/29/2020

Channel Equilibrium Networks for Learning Deep Representation

Convolutional Neural Networks (CNNs) are typically constructed by stacki...
research
04/22/2019

Switchable Whitening for Deep Representation Learning

Normalization methods are essential components in convolutional neural n...
research
03/27/2019

Understanding Unconventional Preprocessors in Deep Convolutional Neural Networks for Face Identification

Deep networks have achieved huge successes in application domains like o...
research
10/21/2021

A scale invariant ranking function for learning-to-rank: a real-world use case

Nowadays, Online Travel Agencies provide the main service for booking ho...
research
05/05/2022

InvNorm: Domain Generalization for Object Detection in Gastrointestinal Endoscopy

Domain Generalization is a challenging topic in computer vision, especia...

Please sign up or login with your details

Forgot password? Click here to reset