Frustratingly Easy Transferability Estimation

06/17/2021
by   Long-Kai Huang, et al.
3

Transferability estimation has been an essential tool in selecting a pre-trained model and the layers of it to transfer, so as to maximize the performance on a target task and prevent negative transfer. Existing estimation algorithms either require intensive training on target tasks or have difficulties in evaluating the transferability between layers. We propose a simple, efficient, and effective transferability measure named TransRate. With single pass through the target data, TransRate measures the transferability as the mutual information between the features of target examples extracted by a pre-trained model and labels of them. We overcome the challenge of efficient mutual information estimation by resorting to coding rate that serves as an effective alternative to entropy. TransRate is theoretically analyzed to be closely related to the performance after transfer learning. Despite its extraordinary simplicity in 10 lines of codes, TransRate performs remarkably well in extensive evaluations on 22 pre-trained models and 16 downstream tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/11/2023

Fast and Accurate Transferability Measurement by Evaluating Intra-class Feature Variance

Given a set of pre-trained models, how can we quickly and accurately fin...
research
08/03/2023

ETran: Energy-Based Transferability Estimation

This paper addresses the problem of ranking pre-trained models for objec...
research
02/27/2020

LEEP: A New Measure to Evaluate Transferability of Learned Representations

We introduce a new measure to evaluate the transferability of representa...
research
09/05/2023

Building a Winning Team: Selecting Source Model Ensembles using a Submodular Transferability Estimation Approach

Estimating the transferability of publicly available pretrained models t...
research
06/30/2020

Data-driven Regularization via Racecar Training for Generalizing Neural Networks

We propose a novel training approach for improving the generalization in...
research
10/14/2021

Omni-Training for Data-Efficient Deep Learning

Learning a generalizable deep model from a few examples in a short time ...
research
05/17/2022

Composing General Audio Representation by Fusing Multilayer Features of a Pre-trained Model

Many application studies rely on audio DNN models pre-trained on a large...

Please sign up or login with your details

Forgot password? Click here to reset