Self-Supervised Metric Learning in Multi-View Data: A Downstream Task Perspective

06/14/2021
by   Shulei Wang, et al.
0

Self-supervised metric learning has been a successful approach for learning a distance from an unlabeled dataset. The resulting distance is broadly useful for improving various distance-based downstream tasks, even when no information from downstream tasks is utilized in the metric learning stage. To gain insights into this approach, we develop a statistical framework to theoretically study how self-supervised metric learning can benefit downstream tasks in the context of multi-view data. Under this framework, we show that the target distance of metric learning satisfies several desired properties for the downstream tasks. On the other hand, our investigation suggests the target distance can be further improved by moderating each direction's weights. In addition, our analysis precisely characterizes the improvement by self-supervised metric learning on four commonly used downstream tasks: sample identification, two-sample testing, k-means clustering, and k-nearest neighbor classification. As a by-product, we propose a simple spectral method for self-supervised metric learning, which is computationally efficient and minimax optimal for estimating target distance. Finally, numerical experiments are presented to support the theoretical results in the paper.

READ FULL TEXT
research
05/29/2023

MT-SLVR: Multi-Task Self-Supervised Learning for Transformation In(Variant) Representations

Contrastive self-supervised learning has gained attention for its abilit...
research
02/19/2020

Event sequence metric learning

In this paper we consider a challenging problem of learning discriminati...
research
05/12/2022

Accounting for the Sequential Nature of States to Learn Features for Reinforcement Learning

In this work, we investigate the properties of data that cause popular r...
research
11/01/2022

Augmentation Invariant Manifold Learning

Data augmentation is a widely used technique and an essential ingredient...
research
05/16/2022

Manifold Characteristics That Predict Downstream Task Performance

Pretraining methods are typically compared by evaluating the accuracy of...
research
06/12/2020

Video Understanding as Machine Translation

With the advent of large-scale multimodal video datasets, especially seq...
research
02/18/2022

Masked prediction tasks: a parameter identifiability view

The vast majority of work in self-supervised learning, both theoretical ...

Please sign up or login with your details

Forgot password? Click here to reset