Knowledge Transfer with Jacobian Matching

03/01/2018
by   Suraj Srinivas, et al.
0

Classical distillation methods transfer representations from a "teacher" neural network to a "student" network by matching their output activations. Recent methods also match the Jacobians, or the gradient of output activations with the input. However, this involves making some ad hoc decisions, in particular, the choice of the loss function. In this paper, we first establish an equivalence between Jacobian matching and distillation with input noise, from which we derive appropriate loss functions for Jacobian matching. We then rely on this analysis to apply Jacobian matching to transfer learning by establishing equivalence of a recent transfer learning procedure to distillation. We then show experimentally on standard image datasets that Jacobian-based penalties improve distillation, robustness to noisy inputs, and transfer learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/11/2019

Variational Information Distillation for Knowledge Transfer

Transferring knowledge from a teacher neural network pretrained on the s...
research
07/10/2020

Transformations between deep neural networks

We propose to test, and when possible establish, an equivalence between ...
research
03/31/2021

Knowledge Distillation By Sparse Representation Matching

Knowledge Distillation refers to a class of methods that transfers the k...
research
06/12/2020

Knowledge Distillation Meets Self-Supervision

Knowledge distillation, which involves extracting the "dark knowledge" f...
research
10/05/2022

On Neural Consolidation for Transfer in Reinforcement Learning

Although transfer learning is considered to be a milestone in deep reinf...
research
05/01/2020

Can a powerful neural network be a teacher for a weaker neural network?

The transfer learning technique is widely used to learning in one contex...
research
11/11/2022

Palm Vein Recognition via Multi-task Loss Function and Attention Layer

With the improvement of arithmetic power and algorithm accuracy of perso...

Please sign up or login with your details

Forgot password? Click here to reset