Adapted Deep Embeddings: A Synthesis of Methods for k-Shot Inductive Transfer Learning

05/22/2018
by   Tyler R. Scott, et al.
0

The focus in machine learning has branched beyond training classifiers on a single task to investigating how previously acquired knowledge in a source domain can be leveraged to facilitate learning in a related target domain, known as inductive transfer learning. Three active lines of research have independently explored transfer learning using neural networks. In weight transfer, a model trained on the source domain is used as an initialization point for a network to be trained on the target domain. In deep metric learning, the source domain is used to construct an embedding that captures class structure in both the source and target domains. In few-shot learning, the focus is on generalizing well in the target domain based on a limited number of labeled examples. We compare state-of-the-art methods from these three paradigms and also explore hybrid adapted-embedding methods that use limited target-domain data to fine tune embeddings constructed from source-domain data. We conduct a systematic comparison of methods in a variety of domains, varying the number of labeled instances available in the target domain (k), as well as the number of target-domain classes. We reach three principle conclusions: (1) Deep embeddings are far superior, compared to weight transfer, as a starting point for inter-domain transfer or model re-use (2) Our hybrid methods robustly outperform every few-shot learning and every deep metric learning method previously proposed, with a mean error reduction of 30 over state-of-the-art. (3) Among loss functions for discovering embeddings, the histogram loss (Ustinova & Lempitsky, 2016) is most robust. We hope our results will motivate a unification of research in weight transfer, deep metric learning, and few-shot learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2021

A new semi-supervised inductive transfer learning framework: Co-Transfer

In many practical data mining scenarios, such as network intrusion detec...
research
05/23/2023

Deep Transductive Transfer Learning for Automatic Target Recognition

One of the major obstacles in designing an automatic target recognition ...
research
10/15/2020

Self-training for Few-shot Transfer Across Extreme Task Differences

All few-shot learning techniques must be pre-trained on a large, labeled...
research
07/09/2020

n-Reference Transfer Learning for Saliency Prediction

Benefiting from deep learning research and large-scale datasets, salienc...
research
09/16/2019

Transfer learning for Remaining Useful Life Prediction Based on Consensus Self-Organizing Models

The traditional paradigm for developing machine prognostics usually reli...
research
10/28/2019

Evaluating Lottery Tickets Under Distributional Shifts

The Lottery Ticket Hypothesis suggests large, over-parameterized neural ...
research
04/08/2019

Transferring Knowledge Fragments for Learning Distance Metric from A Heterogeneous Domain

The goal of transfer learning is to improve the performance of target le...

Please sign up or login with your details

Forgot password? Click here to reset