Δ-Networks for Efficient Model Patching

03/26/2023
by   Chaitanya Devaguptapu, et al.
0

Models pre-trained on large-scale datasets are often finetuned to support newer tasks and datasets that arrive over time. This process necessitates storing copies of the model over time for each task that the pre-trained model is finetuned to. Building on top of recent model patching work, we propose Δ-Patching for finetuning neural network models in an efficient manner, without the need to store model copies. We propose a simple and lightweight method called Δ-Networks to achieve this objective. Our comprehensive experiments across setting and architecture variants show that Δ-Networks outperform earlier model patching work while only requiring a fraction of parameters to be trained. We also show that this approach can be used for other problem settings such as transfer learning and zero-shot domain adaptation, as well as other tasks such as detection and segmentation.

READ FULL TEXT
research
09/14/2023

Efficiently Robustify Pre-trained Models

A recent trend in deep learning algorithms has been towards training lar...
research
06/13/2021

GenSF: Simultaneous Adaptation of Generative Pre-trained Models and Slot Filling

In transfer learning, it is imperative to achieve strong alignment betwe...
research
02/26/2023

Scalable Weight Reparametrization for Efficient Transfer Learning

This paper proposes a novel, efficient transfer learning method, called ...
research
04/26/2019

Representation Similarity Analysis for Efficient Task taxonomy & Transfer Learning

Transfer learning is widely used in deep neural network models when ther...
research
05/26/2021

Unsupervised Pronoun Resolution via Masked Noun-Phrase Prediction

In this work, we propose Masked Noun-Phrase Prediction (MNPP), a pre-tra...
research
01/27/2022

Few-shot Transfer Learning for Holographic Image Reconstruction using a Recurrent Neural Network

Deep learning-based methods in computational microscopy have been shown ...
research
11/23/2018

Learning Grouped Convolution for Efficient Domain Adaptation

This paper presents Dokei, an effective supervised domain adaptation met...

Please sign up or login with your details

Forgot password? Click here to reset