Time Does Tell: Self-Supervised Time-Tuning of Dense Image Representations

08/22/2023
by   Mohammadreza Salehi, et al.
0

Spatially dense self-supervised learning is a rapidly growing problem domain with promising applications for unsupervised segmentation and pretraining for dense downstream tasks. Despite the abundance of temporal data in the form of videos, this information-rich source has been largely overlooked. Our paper aims to address this gap by proposing a novel approach that incorporates temporal consistency in dense self-supervised learning. While methods designed solely for images face difficulties in achieving even the same performance on videos, our method improves not only the representation quality for videos-but also images. Our approach, which we call time-tuning, starts from image-pretrained models and fine-tunes them with a novel self-supervised temporal-alignment clustering loss on unlabeled videos. This effectively facilitates the transfer of high-level information from videos to image representations. Time-tuning improves the state-of-the-art by 8-10 unsupervised semantic segmentation on videos and matches it for images. We believe this method paves the way for further self-supervised scaling by leveraging the abundant availability of videos. The implementation can be found here : https://github.com/SMSD75/Timetuning

READ FULL TEXT

page 7

page 13

page 17

page 18

page 19

research
04/27/2022

Self-Supervised Learning of Object Parts for Semantic Segmentation

Progress in self-supervised learning has brought strong general image re...
research
09/14/2023

Nucleus-aware Self-supervised Pretraining Using Unpaired Image-to-image Translation for Histopathology Images

Self-supervised pretraining attempts to enhance model performance by obt...
research
08/25/2020

Confidence-aware Adversarial Learning for Self-supervised Semantic Matching

In this paper, we aim to address the challenging task of semantic matchi...
research
11/24/2021

ViCE: Self-Supervised Visual Concept Embeddings as Contextual and Pixel Appearance Invariant Semantic Representations

This work presents a self-supervised method to learn dense semantically ...
research
10/19/2021

Learning Rich Nearest Neighbor Representations from Self-supervised Ensembles

Pretraining convolutional neural networks via self-supervision, and appl...
research
03/19/2023

More From Less: Self-Supervised Knowledge Distillation for Information-Sparse Histopathology Data

Medical imaging technologies are generating increasingly large amounts o...
research
08/16/2021

Improving Self-supervised Learning with Hardness-aware Dynamic Curriculum Learning: An Application to Digital Pathology

Self-supervised learning (SSL) has recently shown tremendous potential t...

Please sign up or login with your details

Forgot password? Click here to reset