Model Reprogramming: Resource-Efficient Cross-Domain Machine Learning

02/22/2022
by   Pin-Yu Chen, et al.
0

In data-rich domains such as vision, language, and speech, deep learning prevails to deliver high-performance task-specific models and can even learn general task-agnostic representations for efficient finetuning to downstream tasks. However, deep learning in resource-limited domains still faces the following challenges including (i) limited data, (ii) constrained model development cost, and (iii) lack of adequate pre-trained models for effective finetuning. This paper introduces a new technique called model reprogramming to bridge this gap. Model reprogramming enables resource-efficient cross-domain machine learning by repurposing and reusing a well-developed pre-trained model from a source domain to solve tasks in a target domain without model finetuning, where the source and target domains can be vastly different. In many applications, model reprogramming outperforms transfer learning and training from scratch. This paper elucidates the methodology of model reprogramming, summarizes existing use cases, provides a theoretical explanation on the success of model reprogramming, and concludes with a discussion on open-ended research questions and opportunities. A list of model reprogramming studies is actively maintained and updated at https://github.com/IBM/model-reprogramming.

READ FULL TEXT
research
09/08/2018

Instance-based Deep Transfer Learning

Deep transfer learning has acquired significant research interest. It ma...
research
06/14/2019

MediaPipe: A Framework for Building Perception Pipelines

Building applications that perceive the world around them is challenging...
research
01/25/2023

One Model for All Domains: Collaborative Domain-Prefix Tuning for Cross-Domain NER

Cross-domain NER is a challenging task to address the low-resource probl...
research
06/09/2021

Neural Supervised Domain Adaptation by Augmenting Pre-trained Models with Random Units

Neural Transfer Learning (TL) is becoming ubiquitous in Natural Language...
research
03/21/2022

Domain Generalization by Mutual-Information Regularization with Pre-trained Models

Domain generalization (DG) aims to learn a generalized model to an unsee...
research
09/23/2022

Multi-Modal Cross-Domain Alignment Network for Video Moment Retrieval

As an increasingly popular task in multimedia information retrieval, vid...
research
05/07/2021

Unsupervised Cross-Domain Prerequisite Chain Learning using Variational Graph Autoencoders

Learning prerequisite chains is an essential task for efficiently acquir...

Please sign up or login with your details

Forgot password? Click here to reset