Understanding and Improving Convolutional Neural Networks via Concatenated Rectified Linear Units

03/16/2016
by   Wenling Shang, et al.
0

Recently, convolutional neural networks (CNNs) have been used as a powerful tool to solve many problems of machine learning and computer vision. In this paper, we aim to provide insight on the property of convolutional neural networks, as well as a generic method to improve the performance of many CNN architectures. Specifically, we first examine existing CNN models and observe an intriguing property that the filters in the lower layers form pairs (i.e., filters with opposite phase). Inspired by our observation, we propose a novel, simple yet effective activation scheme called concatenated ReLU (CRelu) and theoretically analyze its reconstruction property in CNNs. We integrate CRelu into several state-of-the-art CNN architectures and demonstrate improvement in their recognition performance on CIFAR-10/100 and ImageNet datasets with fewer trainable parameters. Our results suggest that better understanding of the properties of CNNs can lead to significant performance improvement with a simple modification.

READ FULL TEXT

page 1

page 8

page 17

research
05/20/2018

Wavelet Convolutional Neural Networks

Spatial and spectral approaches are two major approaches for image proce...
research
06/06/2021

Highlighting the Importance of Reducing Research Bias and Carbon Emissions in CNNs

Convolutional neural networks (CNNs) have become commonplace in addressi...
research
11/23/2016

Multigrid Neural Architectures

We propose a multigrid extension of convolutional neural networks (CNNs)...
research
06/18/2023

Learn to Enhance the Negative Information in Convolutional Neural Network

This paper proposes a learnable nonlinear activation mechanism specifica...
research
12/19/2014

Generative Modeling of Convolutional Neural Networks

The convolutional neural networks (CNNs) have proven to be a powerful to...
research
05/08/2018

Learning on the Edge: Explicit Boundary Handling in CNNs

Convolutional neural networks (CNNs) handle the case where filters exten...
research
11/30/2018

Projection Convolutional Neural Networks for 1-bit CNNs via Discrete Back Propagation

The advancement of deep convolutional neural networks (DCNNs) has driven...

Please sign up or login with your details

Forgot password? Click here to reset