DropCluster: A structured dropout for convolutional networks

02/07/2020
by   Liyan Chen, et al.
16

Dropout as a regularizer in deep neural networks has been less effective in convolutional layers than in fully connected layers. This is due to the fact that dropout drops features randomly. When features are spatially correlated as in the case of convolutional layers, information about the dropped pixels can still propagate to the next layers via neighboring pixels. In order to address this problem, more structured forms of dropout have been proposed. A drawback of these methods is that they do not adapt to the data. In this work, we introduce a novel structured regularization for convolutional layers, which we call DropCluster. Our regularizer relies on data-driven structure. It finds clusters of correlated features in convolutional layer outputs and drops the clusters randomly at each iteration. The clusters are learned and updated during model training so that they adapt both to the data and to the model weights. Our experiments on the ResNet-50 architecture demonstrate that our approach achieves better performance than DropBlock or other existing structured dropout variants. We also demonstrate the robustness of our approach when the size of training data is limited and when there is corruption in the data at test time.

READ FULL TEXT

page 3

page 4

page 6

page 8

research
10/30/2018

DropBlock: A regularization method for convolutional networks

Deep neural networks often work well when they are over-parameterized an...
research
07/27/2023

R-Block: Regularized Block of Dropout for convolutional networks

Dropout as a regularization technique is widely used in fully connected ...
research
10/23/2018

DropFilter: Dropout for Convolutions

Using a large number of parameters , deep neural networks have achieved ...
research
05/13/2022

Structural Dropout for Model Width Compression

Existing ML models are known to be highly over-parametrized, and use sig...
research
12/02/2020

Data-driven Analysis of Turbulent Flame Images

Turbulent premixed flames are important for power generation using gas t...
research
07/31/2018

Using Feature Grouping as a Stochastic Regularizer for High-Dimensional Noisy Data

The use of complex models --with many parameters-- is challenging with h...
research
12/22/2014

Learning Compact Convolutional Neural Networks with Nested Dropout

Recently, nested dropout was proposed as a method for ordering represent...

Please sign up or login with your details

Forgot password? Click here to reset