Transfer Learning Between Different Architectures Via Weights Injection

01/07/2021
by   Maciej A. Czyzewski, et al.
0

This work presents a naive algorithm for parameter transfer between different architectures with a computationally cheap injection technique (which does not require data). The primary objective is to speed up the training of neural networks from scratch. It was found in this study that transferring knowledge from any architecture was superior to Kaiming and Xavier for initialization. In conclusion, the method presented is found to converge faster, which makes it a drop-in replacement for classical methods. The method involves: 1) matching: the layers of the pre-trained model with the targeted model; 2) injection: the tensor is transformed into a desired shape. This work provides a comparison of similarity between the current SOTA architectures (ImageNet), by utilising TLI (Transfer Learning by Injection) score.

READ FULL TEXT
research
12/28/2022

Breaking the Architecture Barrier: A Method for Efficient Knowledge Transfer Across Networks

Transfer learning is a popular technique for improving the performance o...
research
03/03/2020

Exploring the Efficacy of Transfer Learning in Mining Image-Based Software Artifacts

Transfer learning allows us to train deep architectures requiring a larg...
research
11/18/2019

Towards Making Deep Transfer Learning Never Hurt

Transfer learning have been frequently used to improve deep neural netwo...
research
06/20/2021

Multirate Training of Neural Networks

We propose multirate training of neural networks: partitioning neural ne...
research
11/18/2019

A Code injection Method for Rapid Docker Image Building

Docker images are built by layers, yet the current implementation has ma...
research
07/08/2020

Quality Classification of Defective Parts from Injection Moulding

This report examines machine learning algorithms for detecting short for...
research
09/10/2019

Accelerating Training using Tensor Decomposition

Tensor decomposition is one of the well-known approaches to reduce the l...

Please sign up or login with your details

Forgot password? Click here to reset