DropBlock: A regularization method for convolutional networks

10/30/2018
by   Golnaz Ghiasi, et al.
12

Deep neural networks often work well when they are over-parameterized and trained with a massive amount of noise and regularization, such as weight decay and dropout. Although dropout is widely used as a regularization technique for fully connected layers, it is often less effective for convolutional layers. This lack of success of dropout for convolutional layers is perhaps due to the fact that activation units in convolutional layers are spatially correlated so information can still flow through convolutional networks despite dropout. Thus a structured form of dropout is needed to regularize convolutional networks. In this paper, we introduce DropBlock, a form of structured dropout, where units in a contiguous region of a feature map are dropped together. We found that applying DropbBlock in skip connections in addition to the convolution layers increases the accuracy. Also, gradually increasing number of dropped units during training leads to better accuracy and more robust to hyperparameter choices. Extensive experiments show that DropBlock works better than dropout in regularizing convolutional networks. On ImageNet classification, ResNet-50 architecture with DropBlock achieves 78.13% accuracy, which is more than 1.6% improvement on the baseline. On COCO detection, DropBlock improves Average Precision of RetinaNet from 36.8% to 38.4%.

READ FULL TEXT

page 2

page 7

research
02/07/2020

DropCluster: A structured dropout for convolutional networks

Dropout as a regularizer in deep neural networks has been less effective...
research
07/27/2023

R-Block: Regularized Block of Dropout for convolutional networks

Dropout as a regularization technique is widely used in fully connected ...
research
10/23/2018

DropFilter: Dropout for Convolutions

Using a large number of parameters , deep neural networks have achieved ...
research
08/24/2019

Don't ignore Dropout in Fully Convolutional Networks

Data for Image segmentation models can be costly to obtain due to the pr...
research
11/16/2018

Residual Convolutional Neural Network Revisited with Active Weighted Mapping

In visual recognition, the key to the performance improvement of ResNet ...
research
03/29/2021

FocusedDropout for Convolutional Neural Network

In convolutional neural network (CNN), dropout cannot work well because ...
research
12/02/2020

Data-driven Analysis of Turbulent Flame Images

Turbulent premixed flames are important for power generation using gas t...

Please sign up or login with your details

Forgot password? Click here to reset