Tune-Mode ConvBN Blocks For Efficient Transfer Learning

05/19/2023
by   Kaichao You, et al.
0

Convolution-BatchNorm (ConvBN) blocks are integral components in various computer vision tasks and other domains. A ConvBN block can operate in three modes: Train, Eval, and Deploy. While the Train mode is indispensable for training models from scratch, the Eval mode is suitable for transfer learning and model validation, and the Deploy mode is designed for the deployment of models. This paper focuses on the trade-off between stability and efficiency in ConvBN blocks: Deploy mode is efficient but suffers from training instability; Eval mode is widely used in transfer learning but lacks efficiency. To solve the dilemma, we theoretically reveal the reason behind the diminished training stability observed in the Deploy mode. Subsequently, we propose a novel Tune mode to bridge the gap between Eval mode and Deploy mode. The proposed Tune mode is as stable as Eval mode for transfer learning, and its computational efficiency closely matches that of the Deploy mode. Through extensive experiments in both object detection and classification tasks, carried out across various datasets and model architectures, we demonstrate that the proposed Tune mode does not hurt the original performance while significantly reducing GPU memory footprint and training time, thereby contributing an efficient solution to transfer learning with convolutional networks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/06/2021

UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks

UniLog: Deploy One Model and Specialize it for All Log Analysis Tasks...
research
03/24/2021

Factors of Influence for Transfer Learning across Diverse Appearance Domains and Task Types

Transfer learning enables to re-use knowledge learned on a source task t...
research
11/16/2018

Domain Adaptive Transfer Learning with Specialist Models

Transfer learning is a widely used method to build high performing compu...
research
01/21/2020

Transfer Learning using Neural Ordinary Differential Equations

A concept of using Neural Ordinary Differential Equations(NODE) for Tran...
research
12/05/2020

Transfer Learning for Human Activity Recognition using Representational Analysis of Neural Networks

Human activity recognition (HAR) research has increased in recent years ...
research
12/09/2020

Transfer Learning for Efficient Iterative Safety Validation

Safety validation is important during the development of safety-critical...
research
12/09/2022

Transfer Learning Enhanced DeepONet for Long-Time Prediction of Evolution Equations

Deep operator network (DeepONet) has demonstrated great success in vario...

Please sign up or login with your details

Forgot password? Click here to reset