On the stepwise nature of self-supervised learning

03/27/2023
by   James B. Simon, et al.
0

We present a simple picture of the training process of self-supervised learning methods with joint embedding networks. We find that these methods learn their high-dimensional embeddings one dimension at a time in a sequence of discrete, well-separated steps. We arrive at this conclusion via the study of a linearized model of Barlow Twins applicable to the case in which the trained network is infinitely wide. We solve the training dynamics of this model from small initialization, finding that the model learns the top eigenmodes of a certain contrastive kernel in a stepwise fashion, and obtain a closed-form expression for the final learned representations. Remarkably, we then see the same stepwise learning phenomenon when training deep ResNets using the Barlow Twins, SimCLR, and VICReg losses. Our theory suggests that, just as kernel regression can be thought of as a model of supervised learning, kernel PCA may serve as a useful model of self-supervised learning.

READ FULL TEXT

page 14

page 15

research
09/29/2022

Joint Embedding Self-Supervised Learning in the Kernel Regime

The fundamental goal of self-supervised learning (SSL) is to produce use...
research
05/27/2023

Kernel-SSL: Kernel KL Divergence for Self-Supervised Learning

Contrastive learning usually compares one positive anchor sample with lo...
research
01/02/2020

Self-Supervised Learning of Generative Spin-Glasses with Normalizing Flows

Spin-glasses are universal models that can capture complex behavior of m...
research
05/26/2023

Unsupervised Embedding Quality Evaluation

Unsupervised learning has recently significantly gained in popularity, e...
research
09/06/2021

Training Deep Networks from Zero to Hero: avoiding pitfalls and going beyond

Training deep neural networks may be challenging in real world data. Usi...
research
07/28/2022

Self-supervised learning with rotation-invariant kernels

A major paradigm for learning image representations in a self-supervised...
research
06/16/2021

Nonequilibrium thermodynamics of self-supervised learning

Self-supervised learning (SSL) of energy based models has an intuitive r...

Please sign up or login with your details

Forgot password? Click here to reset