Data-Driven Sparse Structure Selection for Deep Neural Networks

07/05/2017
by   Zehao Huang, et al.
0

Deep convolutional neural networks have liberated its extraordinary power on various tasks. However, it is still very challenging to deploy state-of-the-art models into real-world applications due to their high computational complexity. How can we design a compact and effective network without massive experiments and expert knowledge? In this paper, we propose a simple and effective framework to learn and prune deep models in an end-to-end manner. In our framework, a new type of parameter -- scaling factor is first introduced to scale the outputs of specific structures, such as neurons, groups or residual blocks. Then we add sparsity regularizations on these factors, and solve this optimization problem by a modified stochastic Accelerated Proximal Gradient (APG) method. By forcing some of the factors to zero, we can safely remove the corresponding structures, thus prune the unimportant parts of a CNN. Compared with other structure selection methods that may need thousands of trials or iterative fine-tuning, our method is trained fully end-to-end in one training pass without bells and whistles. We evaluate our method, Sparse Structure Selection with two state-of-the-art CNNs ResNet and ResNeXt, and demonstrate very promising results with adaptive depth and width selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/22/2019

Towards Optimal Structured CNN Pruning via Generative Adversarial Learning

Structured pruning of filters or neurons has received increased focus fo...
research
08/22/2017

Learning Efficient Convolutional Networks through Network Slimming

The deployment of deep convolutional neural networks (CNNs) in many real...
research
07/02/2023

A Proximal Algorithm for Network Slimming

As a popular channel pruning method for convolutional neural networks (C...
research
08/27/2017

Imbalanced Malware Images Classification: a CNN based Approach

Deep convolutional neural networks (CNNs) can be applied to malware bina...
research
11/16/2016

Cost-Sensitive Deep Learning with Layer-Wise Cost Estimation

While deep neural networks have succeeded in several visual applications...
research
07/08/2016

Enlightening Deep Neural Networks with Knowledge of Confounding Factors

Deep learning techniques have demonstrated significant capacity in model...
research
12/29/2019

Improving Deep Neuroevolution via Deep Innovation Protection

Evolutionary-based optimization approaches have recently shown promising...

Please sign up or login with your details

Forgot password? Click here to reset