Momentum^2 Teacher: Momentum Teacher with Momentum Statistics for Self-Supervised Learning

01/19/2021
by   Zeming Li, et al.
0

In this paper, we present a novel approach, Momentum^2 Teacher, for student-teacher based self-supervised learning. The approach performs momentum update on both network weights and batch normalization (BN) statistics. The teacher's weight is a momentum update of the student, and the teacher's BN statistics is a momentum update of those in history. The Momentum^2 Teacher is simple and efficient. It can achieve the state of the art results (74.5%) under ImageNet linear evaluation protocol using small-batch size(, 128), without requiring large-batch training on special hardware like TPU or inefficient across GPU operation (, shuffling BN, synced BN). Our implementation and pre-trained models will be given on GitHub[https://github.com/zengarden/momentum2-teacher].

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/21/2021

Exponential Moving Average Normalization for Self-supervised and Semi-supervised Learning

We present a plug-in replacement for batch normalization (BN) called exp...
research
02/04/2023

MOMA:Distill from Self-Supervised Teachers

Contrastive Learning and Masked Image Modelling have demonstrated except...
research
04/26/2023

Hopfield model with planted patterns: a teacher-student self-supervised learning model

While Hopfield networks are known as paradigmatic models for memory stor...
research
07/15/2021

A Multimodal Machine Learning Framework for Teacher Vocal Delivery Evaluation

The quality of vocal delivery is one of the key indicators for evaluatin...
research
11/17/2022

Self-Supervised Visual Representation Learning via Residual Momentum

Self-supervised learning (SSL) approaches have shown promising capabilit...
research
12/06/2022

Self-Supervised Audio-Visual Speech Representations Learning By Multimodal Self-Distillation

In this work, we present a novel method, named AV2vec, for learning audi...
research
06/15/2020

Slowing Down the Weight Norm Increase in Momentum-based Optimizers

Normalization techniques, such as batch normalization (BN), have led to ...

Please sign up or login with your details

Forgot password? Click here to reset