An information-Theoretic Approach to Semi-supervised Transfer Learning

06/11/2023
by   Daniel Jakubovitz, et al.
0

Transfer learning is a valuable tool in deep learning as it allows propagating information from one "source dataset" to another "target dataset", especially in the case of a small number of training examples in the latter. Yet, discrepancies between the underlying distributions of the source and target data are commonplace and are known to have a substantial impact on algorithm performance. In this work we suggest novel information-theoretic approaches for the analysis of the performance of deep neural networks in the context of transfer learning. We focus on the task of semi-supervised transfer learning, in which unlabeled samples from the target dataset are available during network training on the source dataset. Our theory suggests that one may improve the transferability of a deep neural network by incorporating regularization terms on the target data based on information-theoretic quantities, namely the Mutual Information and the Lautum Information. We demonstrate the effectiveness of the proposed approaches in various semi-supervised transfer learning experiments.

READ FULL TEXT
research
04/02/2019

Lautum Regularization for Semi-supervised Transfer Learning

Transfer learning is a very important tool in deep learning as it allows...
research
01/17/2022

Transfer Learning in Quantum Parametric Classifiers: An Information-Theoretic Generalization Analysis

A key step in quantum machine learning with classical inputs is the desi...
research
06/29/2023

Transfer Learning with Semi-Supervised Dataset Annotation for Birdcall Classification

We present working notes on transfer learning with semi-supervised datas...
research
02/09/2018

Information Planning for Text Data

Information planning enables faster learning with fewer training example...
research
04/26/2021

Improving Botnet Detection with Recurrent Neural Network and Transfer Learning

Botnet detection is a critical step in stopping the spread of botnets an...
research
06/11/2020

Deep Transfer Learning with Ridge Regression

The large amount of online data and vast array of computing resources en...
research
02/09/2020

GradMix: Multi-source Transfer across Domains and Tasks

The computer vision community is witnessing an unprecedented rate of new...

Please sign up or login with your details

Forgot password? Click here to reset