Decomposing Convolutional Neural Networks into Reusable and Replaceable Modules

10/11/2021
by   Rangeet Pan, et al.
0

Training from scratch is the most common way to build a Convolutional Neural Network (CNN) based model. What if we can build new CNN models by reusing parts from previously build CNN models? What if we can improve a CNN model by replacing (possibly faulty) parts with other parts? In both cases, instead of training, can we identify the part responsible for each output class (module) in the model(s) and reuse or replace only the desired output classes to build a model? Prior work has proposed decomposing dense-based networks into modules (one for each output class) to enable reusability and replaceability in various scenarios. However, this work is limited to the dense layers and based on the one-to-one relationship between the nodes in consecutive layers. Due to the shared architecture in the CNN model, prior work cannot be adapted directly. In this paper, we propose to decompose a CNN model used for image classification problems into modules for each output class. These modules can further be reused or replaced to build a new model. We have evaluated our approach with CIFAR-10, CIFAR-100, and ImageNet tiny datasets with three variations of ResNet models and found that enabling decomposition comes with a small cost (2.38 0.81 reusing or replacing modules can be done with a 2.3 accuracy. Furthermore, reusing and replacing these modules reduces CO2e emission by  37 times compared to training the model from scratch.

READ FULL TEXT
research
06/04/2019

Visual Tree Convolutional Neural Network in Image Classification

In image classification, Convolutional Neural Network(CNN) models have a...
research
12/09/2022

Decomposing a Recurrent Neural Network into Modules for Enabling Reusability and Replacement

Can we take a recurrent neural network (RNN) trained to translate betwee...
research
09/28/2017

B-CNN: Branch Convolutional Neural Network for Hierarchical Classification

Convolutional Neural Network (CNN) image classifiers are traditionally d...
research
11/26/2017

JPEG Steganalysis Based on DenseNet

Current research has indicated that convolution neural networks (CNNs) c...
research
11/25/2017

Gradually Updated Neural Networks for Large-Scale Image Recognition

We present a simple yet effective neural network architecture for image ...
research
07/20/2020

HPIPE: Heterogeneous Layer-Pipelined and Sparse-Aware CNN Inference for FPGAs

We present both a novel Convolutional Neural Network (CNN) accelerator a...
research
08/30/2017

Efficient Convolutional Network Learning using Parametric Log based Dual-Tree Wavelet ScatterNet

We propose a DTCWT ScatterNet Convolutional Neural Network (DTSCNN) form...

Please sign up or login with your details

Forgot password? Click here to reset