PAC-Net: A Model Pruning Approach to Inductive Transfer Learning

06/12/2022
by   Sanghoon Myung, et al.
0

Inductive transfer learning aims to learn from a small amount of training data for the target task by utilizing a pre-trained model from the source task. Most strategies that involve large-scale deep learning models adopt initialization with the pre-trained model and fine-tuning for the target task. However, when using over-parameterized models, we can often prune the model without sacrificing the accuracy of the source task. This motivates us to adopt model pruning for transfer learning with deep learning models. In this paper, we propose PAC-Net, a simple yet effective approach for transfer learning based on pruning. PAC-Net consists of three steps: Prune, Allocate, and Calibrate (PAC). The main idea behind these steps is to identify essential weights for the source task, fine-tune on the source task by updating the essential weights, and then calibrate on the target task by updating the remaining redundant weights. Under the various and extensive set of inductive transfer learning experiments, we show that our method achieves state-of-the-art performance by a large margin.

READ FULL TEXT

page 7

page 9

page 13

research
03/02/2021

TransTailor: Pruning the Pre-trained Model for Improved Transfer Learning

The increasing of pre-trained models has significantly facilitated the p...
research
02/05/2018

Explicit Inductive Bias for Transfer Learning with Convolutional Networks

In inductive transfer learning, fine-tuning pre-trained convolutional ne...
research
10/02/2018

Target Aware Network Adaptation for Efficient Representation Learning

This paper presents an automatic network adaptation method that finds a ...
research
08/19/2023

Disposable Transfer Learning for Selective Source Task Unlearning

Transfer learning is widely used for training deep neural networks (DNN)...
research
01/19/2022

A Review of Deep Transfer Learning and Recent Advancements

A successful deep learning model is dependent on extensive training data...
research
03/02/2023

Optimal transfer protocol by incremental layer defrosting

Transfer learning is a powerful tool enabling model training with limite...
research
03/25/2021

SMILE: Self-Distilled MIxup for Efficient Transfer LEarning

To improve the performance of deep learning, mixup has been proposed to ...

Please sign up or login with your details

Forgot password? Click here to reset