A Main/Subsidiary Network Framework for Simplifying Binary Neural Network

12/11/2018
by   Yinghao Xu, et al.
0

To reduce memory footprint and run-time latency, techniques such as neural network pruning and binarization have been explored separately. However, it is unclear how to combine the best of the two worlds to get extremely small and efficient models. In this paper, we, for the first time, define the filter-level pruning problem for binary neural networks, which cannot be solved by simply migrating existing structural pruning methods for full-precision models. A novel learning-based approach is proposed to prune filters in our main/subsidiary network framework, where the main network is responsible for learning representative features to optimize the prediction performance, and the subsidiary component works as a filter selector on the main network. To avoid gradient mismatch when training the subsidiary component, we propose a layer-wise and bottom-up scheme. We also provide the theoretical and experimental comparison between our learning-based and greedy rule-based methods. Finally, we empirically demonstrate the effectiveness of our approach applied on several binary models, including binarized NIN, VGG-11, and ResNet-18, on various image classification datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/03/2023

Structured Network Pruning by Measuring Filter-wise Interactions

Structured network pruning is a practical approach to reduce computation...
research
06/23/2023

Binary domain generalization for sparsifying binary neural networks

Binary neural networks (BNNs) are an attractive solution for developing ...
research
06/08/2020

A Framework for Neural Network Pruning Using Gibbs Distributions

Neural network pruning is an important technique for creating efficient ...
research
10/01/2018

Layer-compensated Pruning for Resource-constrained Convolutional Neural Networks

Resource-efficient convolution neural networks enable not only the intel...
research
05/15/2019

Dynamic Neural Network Channel Execution for Efficient Training

Existing methods for reducing the computational burden of neural network...
research
11/30/2022

Optimizing Explanations by Network Canonization and Hyperparameter Search

Explainable AI (XAI) is slowly becoming a key component for many AI appl...
research
10/26/2021

Defensive Tensorization

We propose defensive tensorization, an adversarial defence technique tha...

Please sign up or login with your details

Forgot password? Click here to reset