Q-Match: Self-supervised Learning by Matching Distributions Induced by a Queue

02/10/2023
by   Thomas Mulc, et al.
0

In semi-supervised learning, student-teacher distribution matching has been successful in improving performance of models using unlabeled data in conjunction with few labeled samples. In this paper, we aim to replicate that success in the self-supervised setup where we do not have access to any labeled data during pre-training. We introduce our algorithm, Q-Match, and show it is possible to induce the student-teacher distributions without any knowledge of downstream classes by using a queue of embeddings of samples from the unlabeled dataset. We focus our study on tabular datasets and show that Q-Match outperforms previous self-supervised learning techniques when measuring downstream classification performance. Furthermore, we show that our method is sample efficient–in terms of both the labels required for downstream training and the amount of unlabeled data required for pre-training–and scales well to the sizes of both the labeled and unlabeled data.

READ FULL TEXT
research
08/14/2020

Semi-supervised learning using teacher-student models for vocal melody extraction

The lack of labeled data is a major obstacle in many music information r...
research
02/11/2021

SelfHAR: Improving Human Activity Recognition through Self-training with Unlabeled Data

Machine learning and deep learning have shown great promise in mobile se...
research
11/19/2022

Domain-Adaptive Self-Supervised Pre-Training for Face Body Detection in Drawings

Drawings are powerful means of pictorial abstraction and communication. ...
research
08/20/2022

Looking For A Match: Self-supervised Clustering For Automatic Doubt Matching In e-learning Platforms

Recently, e-learning platforms have grown as a place where students can ...
research
04/26/2022

ATST: Audio Representation Learning with Teacher-Student Transformer

Self-supervised learning (SSL) learns knowledge from a large amount of u...
research
08/19/2021

Improving Semi-Supervised Learning for Remaining Useful Lifetime Estimation Through Self-Supervision

RUL estimation suffers from a server data imbalance where data from mach...
research
11/14/2022

Self-training of Machine Learning Models for Liver Histopathology: Generalization under Clinical Shifts

Histopathology images are gigapixel-sized and include features and infor...

Please sign up or login with your details

Forgot password? Click here to reset