VICRegL: Self-Supervised Learning of Local Visual Features

10/04/2022
by   Adrien Bardes, et al.
12

Most recent self-supervised methods for learning image representations focus on either producing a global feature with invariance properties, or producing a set of local features. The former works best for classification tasks while the latter is best for detection and segmentation tasks. This paper explores the fundamental trade-off between learning local and global features. A new method called VICRegL is proposed that learns good global and local features simultaneously, yielding excellent performance on detection and segmentation tasks while maintaining good performance on classification tasks. Concretely, two identical branches of a standard convolutional net architecture are fed two differently distorted versions of the same image. The VICReg criterion is applied to pairs of global feature vectors. Simultaneously, the VICReg criterion is applied to pairs of local feature vectors occurring before the last pooling layer. Two local feature vectors are attracted to each other if their l2-distance is below a threshold or if their relative locations are consistent with a known geometric transformation between the two input images. We demonstrate strong performance on linear classification and segmentation transfer tasks. Code and pretrained models are publicly available at: https://github.com/facebookresearch/VICRegL

READ FULL TEXT

page 4

page 8

page 18

page 19

research
02/01/2023

Image-Based Vehicle Classification by Synergizing Features from Supervised and Self-Supervised Learning Paradigms

This paper introduces a novel approach to leverage features learned from...
research
08/01/2021

Self-supervised Learning with Local Attention-Aware Feature

In this work, we propose a novel methodology for self-supervised learnin...
research
10/07/2022

SVL-Adapter: Self-Supervised Adapter for Vision-Language Pretrained Models

Vision-language models such as CLIP are pretrained on large volumes of i...
research
11/12/2022

Prediction of Geometric Transformation on Cardiac MRI via Convolutional Neural Network

In the field of medical image, deep convolutional neural networks(ConvNe...
research
05/18/2022

Global Contrast Masked Autoencoders Are Powerful Pathological Representation Learners

Based on digital whole slide scanning technique, artificial intelligence...
research
11/16/2022

Hierarchical Dynamic Image Harmonization

Image harmonization is a critical task in computer vision, which aims to...

Please sign up or login with your details

Forgot password? Click here to reset